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(57) Abstract 

Nucleic acid molecules and proteins useful for regulating the expression of genes in eukaryotic cells and organisms are disclosed. 
Theinvention provides a transcriptional activator fusion protein which binds to tet operator sequences and stimulates transcription of a tet 
operator-linked gene in the presence, but not the absence, of tetracycline (or analogue thereof). The invention further provides transcriptional 
inhibitor fusion proteins which inhibit transcription of a tet operator-linked gene in a regulated manner. In one embaiiment, the inhibitor 
fusion protein binds to tet operator sequences in the absence, but not the presence, of tetracycline (or analogue). In another embodiment, the 
inhibitor fusion protein binds to tet operator sequences in the presence, but not the absence, of tetracycline (or analogue). The transcriptional 
activator and inhibitor fusion proteins of the invention can be used in combination to regulate expression of one or multiple tet operator- 
linked genes. Novel tet operator-containing transcription units which allow for coordinate or independent tetracycline-regulated expression 
of two or more genes by the transcriptional modulators of the invention are also disclosed. Kits including the components of the regulatory 
system are also encompassed by the invention. 
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TETRACY CLINE-REGULATED TRANSCRIPTIONAL MODULATORS 

Background of the Invention 

Functional analysis of cellular proteins is greatly facilitated through changes in the 
5 expression level of the corresponding gene for subsequent analysis of the accompanying 
phenotype. For this approach, an inducible expression system controlled by an external 
stimulus is desirable. Ideally such a system would not only mediate an "on/ofF status for 
gene expression but would also permit limited expression of a gene at a defined level 

Attempts to control gene activity have been made using various inducible eukaryotic 

1 0 promoters, such as those responsive to heavy metal ions (Mayo et al (1 982) Cell 22:99-1 08; 
Brinster et al. (1982) Nature 226:39-42; Searle et al. (1985) Mol. Cell Biol. 5:1480-1489), 
heat shock (Nouer et al. (1 991) in Heat Shock Response, e.d. Nouer, L. , CRC, Boca Raton , 
FL, ppl67-220) or hormones (Lee et al. (1981) Nature 224:228-232; Hynes et al. (1981) 
Proa Natl Acad Set USA 28:2038-2042; Klock et al. (1987) Nature 222:734-736; Israel & 

15 Kaufinan (1989) Nucl Acids Res. 12:2589-2604). However, these systems have generally 
suffered from one or both of the following problems: (1) the inducer (e,g, heavy metal ions, 
heat shock or steroid hormones) evokes pleiotropic effects, which can complicate analyses, 
and (2) many promoter systems exhibit high levels of basal activity in the non-induced state, 
which prevents shut-off the regulated gene and results in modest induction factors. 

20 An approach to circumventing these limitations is to introduce regulatory elements 

from evolutionary distant species such as Kcoli into higher eukaryotic cells with the 
anticipation that effectors which modulate such regulatory circuits will be inert to eukaryotic 
cellular physiology and, consequently, will not elicit pleiotropic effects in eukaryotic cells. 
For example, the Lac repressor (lacR)/operator/inducer system of E. coli functions in 

25 eukaryotic cells and has been used to regulate gene expression by three different approaches: 
(1) prevention of transcription initiation by properly placed lac operators at promoter sites 
(Hu & Davidson (1987) Ce//4&:555-566; Brown et al. (1987) Cell 42:603-612; Figge et al. 
(1988) Cell 52:713-722; Fuerst et al. (1989) Proc. Natl Acad Set USA S&2549-2553: 
Deuschle et al. (1989) Proc. Natl. Acad Sci. USA S&5400-5405); (2) blockage of 

30 transcribing RNA polymerase II during elongation by a LacR/operator complex (Deuschle et 
al. (1990) Science 24&:480-483); and (3) activation of a promoter responsive to a fusion 
between LacR and the activation domain of herpes simples virus (HSV) virion protein 16 
(VP16) (Labow et al. (1990) Mol Cell Biol 1^:3343-3356; Baim et al. (1991) Proc. Natl 
Acad Set USA 3&5072-5076). 

35 In one version of the Lac system, expression of lac operator-linked sequences is 

constitutively activated by a LacR-VP16 fusion protein and is turned off in the presence of 
isopropyl-P-D-thiogalactopyranoside (IPTG) (Labow et al. (1990), cited supra). In another 
version of the system, a lacR-VP16 variant is used which binds to lac operators in the 
presence of IPTG, which can be enhanced by increasing the temperature of the cells (Baim et 
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al (1991) cited supra). Hie utility of these lac systems in eukaryotic cells is limited, in part, 
because IPTG acts slowly and inefficiently in eukaryotic cells and must be used at 
concentrations which approach cytotoxic levels. Alternatively, use of a temperature shift to 
induce gene expression is likely to elicit pleiotropic effects in the cell, Thus .here .s a need 

of gene expression and in which the inducer is tolerated by eukaryotic cells without 

cytoxicity or pleiotropic effects. 

Components ofthe tetracycline CTO resistance system of £. coll have also been found 

to function in eukaryotic cells and have been used to regulate gene expression. For example, 

and represses gene transcription, has been expressed in plant cells at sufficiently high 
concentrations to repress transcription from a promoter containing tet operator sequences 
(Gatz,C et al. (1992) Plant J. 2:397-404). However, very high intracellular concentrations 
of TetR are necessary to keep gene expression down-regulated in cells, which may not be 

15 achievable in many situations, thus leading to "leakiness" inthesystem. 

In other studies, TetR has been fused to the activation domain of VP16 to create a 
tetracycline-controlled transcriptional activator (tTA) (Gossen, M. and Bujard, H (1992) 
ProcNall AcadSci. USA 22:5547-5551). The tTA fusion protein is regulated by 
tetracycline in the same manner as TetR i.e., tTA binds to tet operator sequences in the 

20 absence of tetracycline but not in the presence of tetracycline. Thus, in this system, in the 
continuous presence of Tc, gene expression is kept off, and to induce transcription, Tc is 
removed. 



j^ffln^rv nf the Invention 

One aspect ofthe invention pertains to an inducible regulatory system which utilizes 
components ofthe Tet repressor/operator/inducer system of prokaryotes to sUmulate gene 
expression in eukaryotic cells. In the system ofthe invention, transcription of a tet operator- 
linked nucleotide sequence is kept silent in the absence of tetracycline but can be rapidly and 
stronglyihducedmthepresenceo^ TranscnpUon is 

induced by a fusion protein composed of at least two polypeptides, a first polypeptide which 
binds to tet operator sequences in the presence of tetracycline or a tetracycline analogue and a 
second polypeptide which directly or indirectly activates transcription in eukaryotic cells. In 
a preferred embodiment, the first polypeptide of the fusion protein is a mutated Tet repressor 
which is regulated by Tc in a reverse manner from the wild type repressor, i.e., it binds to tet 
operator sequences in the presence rather than in the absence of Tc. Thus, in the absence of 
the inducing agent (Tc or aTc analogue), transcription of a tet operator-linked nucleotide 
sequence remains uninduced. In the presence ofthe inducing agent, transcription of the .tet 
operator-linked nucleotide sequence is stimulated by the transactivator fusion protein ofthe 



25 



30 



35 



invention. 
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The inducible regulatory system of the invention has the advantageous properties that 
induction of gene expression is rapid, efficient and strong (e.g., typically between 1000 to 
2000-fold; up to a 20,000-fold increase in expression has been observed) and the inducing 
agent does not have to be continuously present. Moreover, the inducing agent does not cause 
5 pleiotropic effects or cytotoxicity in eukaryotic cells. The inducible regulatory system of the 
invention can be applied to regulation of gene expression in cells either in vitro or in vivo, 
and may be particularly useful for gene therapy applications and for expression of gene 
products in transgenic and homologous recombinant organisms (e.g., animals and plants). 
The inducible regulatory system of the invention involves at least two components: a 

10 tetracycline-inducible transcriptional activator and a target transcription unit to be regulated. 
Accordingly, one aspect of the invention pertains to a Tc-inducible transcriptional activator 
("transactivator") fusion protein, and nucleic acid (e.g., DNA) encoding the fusion protein. A 
preferred fusion protein comprises a TnlO-encoded Tet repressor that is mutated at at least 
one amino acid position selected from amino acid positions 71, 95, 101 and/or 102, 

1 5 operatively linked to a herpes simplex virus virion protein 1 6 (VP 1 6) activation domain (the 
nucleotide and amino acid sequences of such a fusion protein are shown in SEQ ID NO: 1 
and 2, respectively). In one embodiment, the activation domain of the fusion protein includes 
about 127 C-terminal amino acids of VP 16 (e.g., the fusion protein of SEQ ID NO: 2). In 
another embodiment, the activation domain includes at least one copy of about 1 1 C-terminal 

20 amino acids of VP 16 (e.g., the amino acid sequence of which is shown in SEQ ID NO: 4). 
Other mutated Tet repressors and transcriptional activation domains having the requisite 
functional activities are within the scope of the invention. Additionally, the transactivator 
fusion protein can include a third polypeptide which promotes transport of the fusion protein 
to a cell nucleus. For example, a nuclear localization signal (e.g., having an amino acid 

25 sequence shown in SEQ ID NO: 5) can be incorporated into the fusion protein. The 
invention further provides recombinant vectors and host cells comprising nucleic acid 
encoding a transactivator fusion protein of the invention. The invention still further provides 
transgenic and homologous recombinant organisms comprising nucleic acid encoding a 
transactivator fusion protein of the invention. 

30 A transactivator of the invention is used to regulate the transcription of a target 

transcription unit composed of a nucleotide sequence to be transcribed operatively linked to a 
a minimal promoter sequence and at least one tet operator sequence. The nucleotide 
sequence to be transcribed may encode a protein of interest or an active RNA molecule (e.g., 
antisense RNA molecule or ribozyme) and may be an exogenous or endogenous nucleotide 

35 sequence. A first nucleic acid encoding a transactivator of the invention and a second 

nucleic acid comprising a target transcription unit for the transactivator can be incorporated 
into a nucleic acid composition. The nucleic acid composition can be introduced into host 
cells or organisms (e.g., transgenic and homologous recombinant animals and plants) to allow 
for tetracycline-inducible expression of the target nucleotide sequence to be transcribed. 
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linked to one or more tet operator sequences. The transcriptional inhibitors of the invention 
comprise a fusion protein composed of at least two polypeptides, a first polypeptide that 
binds to tet operator sequences and a heterologous second polypeptide that directly or 
indirectly inhibits transcription in eukaryotic cells. The heterologous the second polypeptide 
5 is derived from a different protein than the first polypeptide. Because the fusion proteins of 
the invention include a eukaryotic transcriptional silencer domain, they are anticipated to be 
more efficient at repressing transcription in eukaryotic cells than is a Tet Repressor alone. 

In one embodiment of the invention, the first polypeptide of the inhibitor fusion 
protein binds to tet operator sequences in the absence but not the presence of tetracycline (Tc) 

10 or an analogue thereof (e.g., the first polypeptide is preferably a Tet repressor, such as a 

Tnl 0-derived Tet repressor having an amino acid sequence shown in SEQ ID NO: 1 7). In the 
absence of tetracycline (or Tc analogue), this fusion protein binds to tet operator sequences 
operatively linked to a gene of interest, thereby inhibiting transcription of the gene of interest. 
In another embodiment, the first polypeptide binds to tet operator sequences in the presence 

1 5 but not the absence of tetracycline (e.g., the first polypeptide is preferably a mutated Tet 
repressor, such as a Tnl 0-derived Tet repressor having an amino acid substitution at position 
71, 95, 101 and/or 102). Preferably, the first polypeptide has an amino acid sequence shown 
in SEQ ID NO: 1 9. In the presence of tetracycline (or Tc analogue), this fusion protein binds 
to tet operator sequences operatively linked to a gene of interest, thereby inhibiting 

20 transcription of the gene of interest 

The second polypeptide can be a transcriptional "silencer" domain from a protein such 
as the v-erbA oncogene product, the Drosophila Krueppel protein, the retinoic acid receptor 
alpha, the thyroid hormone receptor alpha, the yeast Ssn6/Tupl protein complex, the 
Drosophila protein even-skipped, SIR1, NePl, the Drosophila dorsal protein, TSF3, SFI, the 

25 Drosophila hunchback protein, the Drosophila knirps protein, WT1 , Oct-2. 1 , the Drosophila 
engrailed protein, E4BP4 or ZF5. Preferred silencer domains include amino acid residues 
403-466 of Krueppel (shown in SEQ ID NO: 21) and amino acid residues 364-635 of v-erbA 
(shown in SEQ ID NO: 23). 

The fusion proteins of the invention may further comprise additional polypeptides, 

30 such as a third polypeptide which promotes transport of the fusion protein into a cell nucleus 
(i.e., a nuclear transport amino acid sequence). 

This invention further provides isolated nucleic acid molecules encoding the 
transcriptional inhibitor fusion proteins of the invention and recombinant expression vectors 
containing these nucleic acid molecules in a form suitable for expression of the encoded 

35 transcriptional inhibitor fusion protein in a host cell. The invention still further provides host 
cells into which a recombinant expression vectors of the invention has been introduced. 
Thus, a transcriptional inhibitor fusion protein is expressed in these host cells. The host cell 
can be, for example, a mammalian cell (e.g., a human cell), a yeast cell, a fungal cell or an 
insect cell. Moreover, the host cell can be a fertilized non-human oocyte, in which case the 
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host cell can be used to create a transgenic organism having cells that express the 

bITesigned to allow homologous recombination between the nucle.c acd encodmg the 
^teinandatargetge^inahostcell. Such nomologous^mb^nv^rs can 
5 he used to create homologous recombinant animals that express a fus.on protem of the 

"^preferred embodiment, the host cells (or cells of a host organism) also contain a 

nucleotide U^^^^r^ 
eg.,ageneofinterest whose expression can be related by Tc or ■« ^*««* To 

10 eglate^^ 

ri^enthetLs^ 

the absence ofTc, the concentration of Tc in contact with the cells rs decreed to thereby 
^tlscdptionofme.operator^ 
15 in the presence of Tc, then Tc can be removed from the culture medium to mhib t 

^Snofthegeneofinterest). M^V^^oM^^ 
proteinbindsto^operator sequences in the presence ofTctheconcentraUonofTcm 
IctwifcthecoUsisine^^ 

gene of interest (e.g., if cells are first cultured in the absence of Tc, then Tc can be added to 
20 ^c^w^robMAtrnM^^^f^ 

The transcriptional inhibitor fusion proteins of the invenUon are useful for mkbitmg 

gene expression in a variety of situations, as described ^^^^.^ 
pLferre^ embodiment, transcripuonofa^operato^^ 
by a combination of tetracycline-regulated transcriptional inhibitor and actrvator fusmn 
25 pLeins in the same host coll to allow for precise control of the express^ leve of the gene 
o^erest For example, an activator fusion protein that binds to tetO only m the presence of 

LhostceUth^ — a^linkedgeneof interest In me absence of Tc, basal levels of 

Option of the gene of interest «, inhibited by the ^rte™vro«^V m 

30 conta/oftheceUwi*^ 

theactivatorfusionprotein. T*^**Mtot^^«*^«> 
also be used in combination to regulate the expression of multiple */0-lmked genes of 

^Novel kits for regulating the expression of a gene of interest are also within the scope 
35 of the invention. In one embodiment, the kits of the invention can include at least one nuclei 
acidmolecmeencc^gatrar^vatorfusionprotem 

t^enptionunitintowhichageneofinterst can be cloned such mat the gene ,s operauvely 
Unked to a te, operator sequences). Alternatively or additionally, the kits can mclude a 
nucleic acid molecule encoding a transcriptional inhibitor fusion protein of the mventmn. 
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Target transcription units into which multiple genes of interest can be cloned, e.g., for 
coordinate or independent regulation, can also be included in kits of the invention. 
Moreover, at least one tetracycline or tetracycline analogue may be included in the kits. 

5 Britf Description of th$ Brasonga 

Figure 1 is a bar graph depicting the stimulation of luciferase activity in HR5-C 1 1 
cells by tetracycline and different tetracycline analogues (1 ng/ml fx.). Cells were grown in 
the absence (-) or presence of the indicated tetracyclines for 3 days before luciferase activity 
was determined. Each solid and hatched bar represents the luciferase activity of a single 
10 culture dish. 

Figure 2 is a graph depicting the relative luciferase activity in HR5-C1 1 cells when 
incubated with different concentrations of doxycycline. The results of three independent 
experiments are shown. 

Figure 5 is a graph depicting the kinetics of induction of luciferase activity in HR5- 
15 CI 1 cells by doxycycline. HR5-C1 1 cultures were exp<?sed to 1 jig/ml of doxycycline and 
luciferase activity was measured after different time intervals; (•) cultures containing 
doxycycline, (o) cultures grown in the absence of antibiotic. 

Figure 4 shows the amino acid sequences of various classes of Tet repressors, 
illustrating the homology between the amino acid sequences of different classes of Tet 
20 repressors, as compared to class B Tet repressors (e.g., TnlO-derived). Amino acid positions 
in other classes of Tet repressors that are identical to class B are indicated by a dash. 

Figure 5 shows the nucleotide sequences of tet operators of different classes: class A 
(SEQ ID NO: 1 1), class B (SEQ ID NO: 12), class C (SEQ ID NO: 13), class D (SEQ ID 
NO: 14) and class E (SEQ ID NO: 15). 
25 Figure 6 is a schematic diagram of a bidirectional promoter construct for coordinate 

regulation of two genes of interest operatively linked to the same tet operators for regulation 
by a tetracycline-regulated transcriptional activator. 

Figure 7 A (SEQ ID NO: 6) shows the nucleotide sequence of a bidirectional promoter 
region for coordinate regulation of two genes of interest by a tetracycline-regulated 
30 transcriptional activator. 

Figure IB (SEQ ID NO: 7) shows the nucleotide sequence of a bidirectional promoter 
region for coordinate regulation of two genes of interest by a tetracycline-regulated 
transcriptional activator. 

Figure 8 is two graphs depicting coordinate expression of luciferase and P- 
35 galactosidase activity by a tetracycline-regulated transcriptional activator. 

Figure 9A-B are schematic diagrams of self-regulating promoters for expression of 
tetracycline-regulated transcriptional activators (tTA). Panel A illustrates self-regulation of 
expression of a wild-type Tet repressor-containing transactivator fusion protein that binds to 
tet operators in the absence of Tc. Panel B illustrates self-regulation of expression of a 
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transgenicaniinalscarryingthelmferasereporterge^ transg . 

of doxycyclinc (dark columns in middle) or in the presence of doxycychne 0ight 



15 columns at left). 



^^vention^tains to nucleic acid molecules and protems which can be used to 

Zoonents- A gene which is operativery linked to a regulatory sequence and a protein 

IrTither activates or inhibits transcription of the gene. The system of the 
SeTn^n^ 

25 ^o^ 

™cline, or an analogue thereof Thus, in a host cell, transcnp.on of a gene ocxsatively 

^byaStteconc^onoftetracyc^ 
^aL^ 

ceasingtoadministertetracyclinetoahostorg^ A „ fildftn 
The invention^* pertai*^^^ 
35 proteinoftheinvention. In addition to allowing for regulation of a single 

^oflerest, the invention also provides novel transcription units containing two or nwe 
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components of the regulatory system described herein, are also encompassed by the 
invention. 

In the following subsections, the nucleic acids and proteins comprising the 
components of the inducible regulatory system of the invention, and their interrelationship, 
5 are discussed in greater detail. The subsections are as follows: 

I. Tetracycline-Inducible Transcriptional Activators 

A. The first polypeptide of the transactivator fusion protein 

B. The second polypeptide of the transactivator fusion protein 
10 C. A third polypeptide of the transactivator fusion protein 



II. Expression of a Transactivator Fusion Protein 

A. Expression vectors 

B. Host cells 

15 C. Introduction of nucleic acid into host cells 

D. Transgenic Organisms 

E. Homologous Recombinant Organisms 

III. Target Transcription Units Regulated by a Tetracycline-Inducible Transactivator 
20 A. Regulation of expression of tet operator-linked nucleotide sequences 

B. Coordinate regulation of two nucleotide sequences 

C. Independent regulation ofmulitple nucleotide sequences 

D. Combined coordinate and independent regulation of multiple nucleotide 
sequences 

25 

IV. Tetracycline-Regulated Transcriptional Inhibitors 

A. The first polypeptide of the trar c critional inhibitor fusion protein 

B. The second polypeptide of the transcritional inhibitor fusion protein 

C. A third polypeptide of the transcritional inhibitor fusion protein 
30 D. Expression of the transcriptional inhibitor fusion protein 



V. Kits of the Invention 



VI. Regulation of Gene Expression by Tetracycline or Analogues Thereof 
35 A. Stimulation of gene expression by transactivator fusion proteins 

B. Inhibition of gene expression by transcriptional inhibitor fusion proteins 

C. Combined positive and negative regulation of gene expression 
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VII. Applications of the Invention 

A. Gene Therapy 

B. Production of Proteins in Vitro 

C. Production of Proteins in Vivo 

5 . D. Animal Models of Human Disease 

E. Production of Stable Cell Lines for Cloning 

t Tetracycline -lndusifrk Imisgiptional Activators 

S I — to describe al leost two potypoprido, W io=»y fan dtffen. 
^^p^bt^-mwoop^tot^f^ly^^^ 

^Ztooa* XTa^yorioo^soch^oftaU^thot™^ 
other in-frame). 

25 * Tk^first polypeptide of the trar^aMarJusionprotein 

The Levator fusion protein of the invention is composed, m part, of a fir* 

thereof. Ttaf-po^^^^-^'ir^,. 
ren Jl -rte term "mutated Tet repressor" is intended to include polypeptides havmg an 

an^ acid difference from the wUd-type Tet repressor. The term -"-type Tet rep^ssor" 

operator sequences in prokaryotic cells in the absence of Tc. ^ amn«, ac^erenc^ ) 
bTeen Mutated Tet repressor and a wild-type Tet repress may be subsntuuon of one or 
35 more amino acids, deletion of one or more amino acids or addition o one or more annno 
acids. Ita^!*^**^**^^!"^ ° 
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tetracycline than a wild-type Tet repressor, i.e., the mutated Tet repressor binds to a tet 
operator sequence only the presence of Tc (or Tc analogue) rather than in the absence of Tc. 

In a preferred embodiment, a mutated Tet repressor having the functional properties 
described above is created by substitution of amino acid residues in the sequence of a wild- 
5 type Tet repressor. For example, as described in Example 1 , a Tnl 0-derived Tet repressor 
having amino acid substitutions at amino acid positions 71, 95, 101 and 102 has the desired 
functional properties and thus can be used as the first polypeptide in the transactivator fusion 
protein of the invention. The amino acid sequence of this mutated Tet repressor is shown in 
SEQ ID NO: 2 (positions 1-207). In one embodiment of the mutated Tet repressor, position 

10 71 is mutated from glutamic acid to lysine, position 95 is mutated from aspartic acid to 
asparagine, position 101 is mutated from leucine to serine and position 102 is mutated from 
glycine to aspartic acid, although the invention is not limited to these particular mutations. 
Mutation of fewer than all four of these amino acid positions may be sufficient to acheive a 
Tet repressor with the desired functional properties. Accordingly, a Tet repressor is 

1 5 preferably mutated at at least one of these positions. Other amino acid substitutions, 
deletions or additions at these or other amino acid positions which retain the desired 
functional properties of the mutated Tet repressor are within the scope of the invention. The 
crystal structure of a Tet repressor-tetracycline complex, as described in Hinrichs, W. et al. 
(1994) Science 264:418-420, can be used for rational design of mutated Tet repressors. 

20 Based upon this structure, amino acid position 71 is located outside the tetracyline binding 
pocket, suggesting mutation at this site may not be necessary to achieve the desired 
functional properties of a mutated Tet repressor of the invention. In contrast, amino acid 
positions 95, 101 and 102 are located within the conserved tetracyline binding pocket Thus, 
the tetracycline binding pocket of a Tet repressor may be targeted for mutation to create a 

25 mutated Tet repressor of the invention. 

Additional mutated Tet repressors for incorporation into a fusion protein of the 
invention can be created according to the teachings of the invention. A number of different 
classes of Tet repressors have been described, e.g., A, B, C, D and E (of which theTnlO- 
encoded repressor is a class B repressor). The amino acid sequences of the different classes 

30 of Tet repressors share a high degree of homology (i.e., 40-60 % across the length of the 
proteins), including in the region encompassing the above-described mutations. The amino 
acid sequences of various classes of Tet repressors are shown and compared in Figure 4, and 
are also described in Tovar, K. et al. (1988) MoL Gen. Genet. 215:76-80. Accordingly, 
equivalent mutations to those described above for the Tnl 0-derived Tet repressor can be 

35 made in other classes of Tet repressors for inclusion in a fusion protein of the invention. For 
example, amino acid position 95, which is an aspartic acid in all five repressor classes, can be 
mutated to asparagine in any class of repressor. Similarly, position 102, which is glycine in 
all five repressor classes, can be mutated to aspartic acid in any class of repressor. 
Additional suitable equivalent mutations will be apparent to those skilled in the art and can be 
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Alternatively, a mutated class A Tet repressor can be used with a class A tet operator 
sequence, and so on for the other classes of Tet repressor/operators. 

Another approach for creating a mutated Tet repressor which binds to a class A tet 
operator is to further mutate the already mutated TnlO-derived Tet repressor described herein 
5 (a class B repressor) such that it no longer binds efficiently to a class B type operator but 
instead binds efficiently to a class A type operator. It has been found that nucleotide position 
6 of class A or B type operators is the critical nucleotide for recognition of the operator by its 
complimentary repressor (position 6 is a G/C pair in class B operators and an A/T pair in 
class A operators) (see Wissman et al. (1988) J. Uol Biol 202:397-406). It has also been 

1 0 found that amino acid position 40 of a class A or class B Tet repressor is the critical amino 
acid residue for recognition of position 6 of the operator (amino acid position 40 is a 
threonine in class B repressors but is an alanine in class A repressors). It still further has 
been found that substitution of Thr40 of a class B repressor with Ala alters its binding 
specificity such that the repressor can now bind a class A operator (similarly, substitution of 

IS Ala40 of a class A repressor with Thr alters its binding specificity such that the repressor can 
now bind a class B operator) (see Altschmied et al. (1988) EMBOl 2:401 1-4017). 
Accordingly, one can alter the binding specificity of the mutated TnlO-derived Tet repressor 
disclosed herein by additionally changing amino acid residue 40 from Thr to Ala by standard 
molecular biology techniques (e.g., site directed mutagenesis). 

20 A mutated Tet repressor having specific mutations (e.g., at positions 71 , 95, 101 

and/or 1 02, as described above) can be created by introducing nucleotide changes into a 
nucleic acid encoding a wild-type repressor by standard molecular biology techniques, e.g. 
site directed mutagenesis or PCR-mediated mutagenesis using oligonucleotide primers 
incorporating the nucleotide mutations. Alternatively, when a mutated Tet repressor is 

25 identified by selection from a library, the mutated nucleic acid can be recovered from the 
library vector. To create a transactivator fusion protein of the invention, a nucleic acid 
encoding a mutated Tet repressor is then ligated in-frame to another nucleic acid encoding a 
transcriptional activation domain and the fusion construct is incorporated into a recombinant 
expression vector. The transactivator fusion protein can be expressed by introducing the 

30 recombinant expression vector into a host cell or animal. 

B. The second polypeptide of the transactivator fusion protein 

The first polypeptide of the transactivator fusion protein is operatively linked to a 
second polypeptide which directly or indirectly activates transcription in eukaryotic cells. To 
35 operatively link the first and second polypeptides, typically nucleotide sequences encoding 
the first and second polypeptides are ligated to each other in-frame to create a chimeric gene 
encoding a fusion protein, although the first and second polypeptides can be operatively 
linked by other means that preserve die function of each polypeptide (e.g., chemically 
crosslinked). In a preferred embodiment, the second polypeptide of the transactivator itself 
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the scope of the invention. The transcriptional activation ability of a polypeptide can be 
assayed by linking the polypeptide to another polypeptide having DNA binding activity and 
determining the amount of transcription of a target sequence that is stimulated by the fusion 
protein. For example, a standard assay used in the art utilizes a fusion protein of a putative 

5 transcriptional activation domain and a GAL4 DNA binding domain (e.g., amino acid 
residues 1 -93). This fusion protein is then used to stimulate expression of a reporter gene 
linked to GAL4 binding sites (see e.g., Seipel, K. et al. (1992) EMBOJ. 11:4961-4968 and 
references cited therein). 

In another embodiment, the second polypeptide of the fusion protein indirectly 

1 0 activates transcription by recruiting a transcriptional activator to interact with the fusion 
protein. For example, a mutated tetR of the invention can be fused to a polypeptide domain 
(e.g., a dimerization domain) capable of mediating a protein-protein interaction with a 
transcriptional activator protein, such as an endogenous activator present in a host cell. It has 
been demonstrated that functional associations between DNA binding domains and 

15 transactivation domains need not be covalent (see e.g., Fields and Song (1989) Nature 
24Q:245-247; Chien et al. (1991) Proc. Natl Acad ScL USA £4:9578-9582; Gyuris et al. 
(1993) Cell 25:791-803; and Zervos, A.S. (1993) Cell 22:223-232). Accordingly, the second 
polypeptide of the fusion protein may not direcdy activate transcription but rather may form a 
stable interaction with an endogenous polypeptide bearing a compatible protein-protein 

20 interaction domain and transactivation domain. Examples of suitable interaction (or 

dimerization) domains-include leucine zippers (Landschulz et al. (1989) Science 242:1681- 
1688), helix-loop-helix domains (Murre, C. et al. (1989) Cell 54:537-544) and zinc finger 
domains (Frankel, A.D. et al. (1988) Science 24Q:70-73). Interaction of a dimerization 
domain present in the fusion protein with an endogeneous nuclear factor results in 

25 recruitment of the transactivation domain of the nuclear factor to the fusion protein, and 
- thereby to a tet operator sequence to which the fusion protein is bound. 

C. A third polypeptide of the transactiwtor fusion protein 

In addition to a mutated Tet repressor and a transcriptional activation domain, a fusion 

30 protein of the invention can contain an operatively linked third polypeptide which promotes 
transport of the fusion protein to a cell nucleus. Amino acid sequences which, when included 
in a protein, function to promote transport of the protein to the nucleus are known in the art 
and are termed nuclear localization signals (NLS). Nuclear localization signals typically are 
composed of a stretch of basic amino acids. When attached to a heterologous protein (e.g., a 

35 fusion protein of the invention), the nuclear localization signal promotes transport of the 

protein to a cell nucleus. The nuclear localization signal is attached to a heterologous protein 
such that it is exposed on the protein surface and does not interfere with the function of the 
protein. Preferably, the NLS is attached to one end of the protein, e.g. the N-terminus. The 
amino acid sequence of a non-limiting exampleof an NLS that can be included in a fusion 
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Altematively, the regulatoiy sequences of the recombinant expression vector can 
direct expression of the fusion protein preferentially in a particular cell type, i.e., tissue- 
specific regulatory elements can be used Non-limiting examples of tissue-specific promoters 
which can be used include the albumin promoter (liver-specific; Pinkert et al. (1987) Genes 

5 Dev. 1:268-277), lymphoid-specific promoters (Calame and Eaton (1 988) Adv. Immunol 
42:235-275), in particular promoters of T cell receptors (Winoto and Baltimore (1989) 
EMBOJ. fi:729-733) and immunoglobulins (Banerji et al. (1983) Cell 22:729-740; Queen 
and Baltimore (1983) Cell 22^741-748), neuron-specific promoters (e.g., the neurofilament 
promoter, Byrne and Ruddle (1989) Proc. Natl Acad ScL USA £6:5473-5477), pancreas- 

1 0 specific promoters (Edlund et al. (1 985) Science 22&912-91 6), and mammary gland-specific 
promoters (e.g., milk whey promoter, U.S. Patent No. 4,873,316 and European Application 
Publication No. 264,166). Developmentally-regulated promoters are also encompassed, for 
example the murine hox promoters (Kessel and Gruss (1990) Science 242:374-379) and the 
a-fetoprotein promoter (Campes and Tilghman (1989) Genes Dev. 2:537-546). 

1 5 Alternatively, a self-regulating construct encoding a transactivator fusion protein can 

be created To accomplish this, nucleic acid encoding the fusion protein is operatively linked 
to a minimal promoter sequence and at least one let operator sequence. For example, the 
nucleic acid of SEQ ID NO: 1 can be linked to a promoter having a nucleotide sequence 
shown in SEQ ID NO: 8, 9 or 10 (the nucleic acids of SEQ ID NOs: 8 and 9 comprise a 

20 minimal CMV promoter and ten tet operators; the nucleic acids of SEQ ID NO: 10 comprises 
a TK promoter and ten tet operators). A schematic diagram of such a self-regulating 
construct is shown in Figure 9B. When this nucleic acid is introduced into a cell (e.g., in a 
recombinant expression vector), a small amount of basal transcription of the transactivator 
gene is likely to occur due to "leakiness". In the presence of Tc (or analogue thereof) this 

25 small amount of the transactivator fusion protein will bind to the tet operator sequence(s) 
upstream of the nucleotide sequence encoding the transactivator and stimulate additional 
transcription of the nucleotide sequence encoding the transactivator, thereby leading to 
further production of the transactivator fusion protein in the cell. It will be appreciated by 
those skilled in the art that such a self-regulating promoter can also be used in conjunction 

30 with other tetracycline-regulated transactivators, such as the wild-type Tet repressor fusion 
protein (tTA) described in Gossen, M. and Bujard, H. (1992) Proc. Natl Acad Sci. USA 
22:5547-5551 , which binds to tet operators in the absence of Tc (as illustrated in Figure 9A). 
When used in conjunction with this transactivator, self-regulated transcription of the 
nucleotide sequence encoding this transactivator is stimulated in the absence of Tc. The 

35 plasmid pUHDl 5-3, which comprises nucleotide sequences encoding the tTA described in 
Gossen and Bujard (1 992), cited supra, operatively linked to a self-regulating promoter, has 
been deposited on July 8, 1994 under the provisions of the Budapest Treaty at the Deutsche 
Sammlung Von Mikroorganismen und ZellKulturen GmbH (DSM) in Braunschweig, 
Germany and assigned deposit number DSM 9280. 
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modified to contain nucleic acid encoding a transactivator fusion protein. Moreover, plant 
cells can be modified to create transgenic plants. 

The invention is broadly applicable and encompasses non-mammalian eukaryotic 
cells as well, including insect (e.g,. Sp. frugiperda), yeast (e.g., S. cerevisiae, S. pombe, P. 
5 . pastoris, K. lactis, H. polymorpha; as generally reviewed by Fleer, R. (1 992) Current Opinion 
in Biotechnology K5):486-496)), fungal and plant cells. Examples of vectors for expression 
in yeast S. cerfvisae include pYepSecl (Baldari. si aL, (1987) Embo J. &229-234), pMFa 
(Kurjan and Herskowitz, (1982) C<?//2Q:933-943), pJRY88 (Schultztf aL, (1987) Gene 
54:1 13-123), and pYES2 (Invitrogen Corporation, San Diego, CA). The fusion protein can be 
10 expressed in insect cells using baculovirus expression vectors (e.g., as described in O'Reilly 
et al. (1992) Baculovirus Expression Vectors: A Laboratory Manual, Stockton Press). 
Baculovirus vectors available for expression of proteins in cultured insect cells (e.g., SF 9 
cells) include the pAc series (Smith etaL, (1983) Mol Cell Biol 2:2156-2165) and the pVL 
series (Lucklow, V.A., and Summers, M.D., (1989) Virology 12Q:31-39). 

15 

C Introduction of Nucleic Acid into a Host Cell 

Nucleic acid encoding the fusion protein can be introduced into a host cell by standard 
techniques for transfecting eukaryotic cells. The term "transfecting" or "transfection" is 
intended to encompass all conventional techniques for introducing nucleic acid into host 

20 cells, including calcium phosphate co-precipitation, DEAE-dextran-mediated transfection, 
lipofection, electroporation and microinjection. Suitable methods for transfecting host cells 
can be found in Sambrook fit aL {Molecular Cloning: A Laboratory Manual, 2nd Edition, 
Cold Spring Harbor Laboratory press (1989)), and other laboratory textbooks. 

The number of host cells transformed with a nucleic acid of the invention will depend, 

25 at least in part, upon the type of recombinant expression vector used and the type of 

transfection technique used. Nucleic acid can be introduced into a host cell transiently, or 
more typically, for long term regulation of gene expression, the nucleic acid is stably 
integrated into the genome of the host cell or remains as a stable episome in the host cell. 
Plasmid vectors introduced into mammalian cells are typically integrated into host cell DNA 

30 at only a low frequency. In order to identify these integrants, a gene that contains a selectable 
marker (e.g., drug resistance) is generally introduced into the host cells along with the nucleic 
acid of interest Preferred selectable markers include those which confer resistance to certain 
drugs, such as G418 and hygromycin. Selectable markers can be introduced on a separate 
plasmid from the nucleic acid of interest or, are introduced on the same plasmid. Host cells 

35 transfected with a nucleic acid of the invention (e.g., a recombinant expression vector) and a 
gene for a selectable marker can be identified by selecting for cells using the selectable 
marker. For example, if the selectable marker encodes a gene conferring neomycin 
resistance, host cells which have taken up nucleic acid can be selected with G418. Cells that 
have incorporated the selectable marker gene will survive, while the other cells die. 
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increase the efficiency of expression of the transgene. Methods for generating transgenic 
animals, particularly animals such as mice, have become conventional in the art and are 
described, for example, in U.S. Patent Nos. 4,736,866 and 4,870,009 and Hogan, B. et aL, 

(1986) A Laboratory Manual, Cold Spring Harbor, New York, Cold Spring Harbor 

5 Laboratory . A transgenic founder animal can be used to breed additional animals carrying 
the transgene. Transgenic animals carrying a transgene encoding the fusion protein of the 
invention can further be bred to other transgenic animals carrying other transgenes, e.g., to a 
transgenic animal which contains a gene operatively linked to a tet operator sequence 
(discussed in more detail in Section III below). 

10 It will be appreciated that, in addition to transgenic animals, the regulatory system 

described herein can be applied to other transgenic organisms, such as transgenic plants. 
Transgenic plants can be made by conventional techniques known in the art. Accordingly, 
the invention encompasses non-human transgenic organisms, including animals and plants, 
that contains cells which express the transactivator fusion protein of the invention (i.e., a 

1 5 nucleic acid encoding the transactivator is incorporated into one or more chromosomes in 
cells of the transgenic organism). 

E. Homologous Recombinant Organisms 

The invention also provides a homologous recombinant non-human organism 

20 expressing the fusion protein of the invention. The term "homologous recombinant 

organism" as used herein is intended to describe an organism, e.g. animal or plant, containing 
a gene which has been modified by homologous recombination between the gene and a DNA 
molecule introduced into a cell of the animal, e.g., an embryonic cell of the animal. In one 
embodiment, the non-human animal is a mouse, although the invention is not limited thereto. 

25 An animal can be created in which nucleic acid encoding the fusion protein has been 
introduced into a specific site of the genome, i.e., the nucleic acid has homologously 
recombined with an endogenous gene. 

To create such a homologous recombinant animal, a vector is prepared which contains 
DNA encoding the fusion protein flanked at its 5* and 3' ends by additional nucleic acid of a 

30 eukaryotic gene at which homologous recombination is to occur. The additional nucleic acid 
flanking that encoding the fusion protein is of sufficient length for successful homologous 
recombination with the eukaryotic gene. Typically, several kilobases of flanking DNA (both 
at the 5* and 3* ends) are included in the vector (see e.g., Thomas, K.R. and Capecchi, M. R. 

(1987) Cell 51:503 for a description ofhomologous recombination vectors). The vectoris 
35 introduced into an embryonic stem cell line (e.g., by electroporation) and cells in which the 

introduced DNA has homologously recombined with the endogenous DNA are selected (see 
e.g., Li, E. et aL (1992) Cell £2:915). The selected cells are thai injected into a blastocyst of 
an animal (e.g., a mouse) to form aggregation chimeras (see e.g., Bradley, A. in 
Teratocarcinomas and Embryonic Stem Cells: A Practical Approach, EJ. Robertson, ed. 
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one embodiment, the minimal promoter is from the human cytomegalovirus (as described in 
Boshart et al, (1985) Cell 11:521-530). Preferably, nucleotide positions between about +75 
to -53 and +75 to -3 1 are used. Other suitable minimal promoters are known in the art or can 
be identified by standard techniques. For example, a functional promoter which activates 
5 transcription of a contiguously linked reporter gene (e.g., chloramphenicol acetyl transferase, 
p-galactosidase or luciferase) can be progressively deleted until it no longer activates 
expression of the reporter gene alone but rather requires the presence of an additional 
regulatory sequence(s). 

Within a transcription unit, the target nucleotide sequence (including the transcribed 

1 0 nucleotide sequence and its upstream minimal promoter sequence) is operatively linked to at 
least one tet operator sequence. In a typical configuration, the tet operator sequence^) is 
operatively linked upstream (i.e., 5 1 ) of the minimal promoter sequence through a 
phosphodiester bond at a suitable distance to allow for transcription of the target nucleotide 
sequence upon binding of a regulatory protein (e.g., the transactivator fusion protein) to the 

1 5 tet operator sequence. That is, the transcription unit is comprised of, in a 5' to 3' direction: tet 
operator sequences) - a minimal promoter - a transcribed nucleotide sequence. It will be 
appreciated by those skilled in the art that there is some flexibility in the permissable distance 
between the tet operator sequences) and the minimal promoter, although typically the tet 
operator sequences will be located within about 200-400 base pairs upstream of the minimal 

20 promoter. 

The nucleotide sequences of examples of tef-regulated promoters, containing tet 
operator sequences linked to a minimal promoter, that can be used in the invention are shown 
in SEQ ID NO: 8-10. The nucleotide sequences of SEQ ID NOs: 8 and 9 comprise a 
cytomegalovirus minimal promoter linked to ten tet operator sequences; the two nucleotide 

25 sequences differ in the distance between the operators and the first transcribed nucleotide. 
The nucleotide sequence of SEQ ID NO: 10 comprises a herpes simplex virus minimal tk 
promoter linked to ten tet operator sequences. The promoter of SEQ ID NO: 8 corresponds 
to PhCMV*- 1 * described in Gossen, M. and Bujard, H. (1992) Proc. Natl Acad Sci. USA 
52:5547-5551. The promoter of SEQ ID NO: 9 corresponds to PhCMV*- 2 * *&> described in 

30 Gossen, M. and Bujard, H, cited supra. 

Alternatively, since regulatory elements have been observed in the art to function 
downstream of sequences to be transcribed, it is likely that the tet operator sequences) can be 
operatively linked downstream (i.e., 3 1 ) of the transcribed nucleotide sequence. Thus, in this 
configuration, the transcription unit is comprised of, in a 5' to 3' direction: a minimal 

35 promoter - a transcribed nucleotide sequence - tet operator sequences). Again, it will be 
appreciated that there is likely to be some flexibility in the permissable distance downstream 
at which the tet operator sequence(s) can be linked. 

The term n tet operator sequence" is intended to encompass all classes of tet operators 
(e.g., A, B, C, D and E). A nucleotide sequence to be transcribed can be operatively linked to 
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sequence(s) by homologous recombination between a fe/O-containing recombination vector 
and sequences of the endogeneous gene. For example, a homologous recombination vector 
can be prepared which includes at least one tet operator sequence and a miminal promoter 
sequence flanked at its 3' end by sequences representing the coding region of the endogenous 
S gene and flanked at its 5* end by sequences from the upstream region of the endogenous gene 
by excluding the actual promoter region of the endogenous gene. The flanking sequences are 
of sufficient length for successful homologous recombination of the vector DNA with the 
endogenous gene. Preferably, several kilobases of flanking DNA are included in the 
homologous recombination vector. Upon homologous recombination between the vector 

1 0 DNA and the endogenous gene in a host cell, a region of the endogenous promoter is replaced 
by the vector DNA containing one or more tet operator sequences operably linked to a 
minimal promoter. Thus, expression of the endogenous gene is no longer under the control 
of its endogenous promoter but rather is placed under the control of the tet operator 
sequence^) and the minimal promoter. 

IS In another embodiment, tet operator sequences can be inserted elsewhere within an 

endogenous gene, preferably within a 5'or V regulatory region, via homologous 
recombination to create an endogenous gene whose expression can be regulated by a 
tetracycline-regulated fusion protein described herein. For example, one or more tetO 
sequences can be inserted into a promoter or enhancer region of an endogenous gene such 

20 that promoter or enhancer function is maintained (i.e., the tetO sequences are introduced into 
a site of the promoter/enhancer region that is not critical for promoter/enhancer function). 
Regions within promoters or enhancers which can be altered without loss of 
promoter/enhancer function are known in the art for many genes or can be determined by 
standard techniques for analyzing critical regulatory regions. An endogenous gene having 

25 tetO sequences inserted into a non-critical regulatory region will retain the ability to be 
expressed in its normal constitutive and/or tissue-specific manner but, additionally, can be 
downregulated by a tetracycline-controlled transcriptional inhibitor protein in a controlled 
manner. For example, constitutive expression of such a modified endogenous gene can be 
inhibited by in the presence of tetracycline (or analogue) using an inhibitor fusion protein that 

30 binds to tetO sequences in the presence of tetracycline (or analogue) (as described in further 
detail in Section IV and Section VI, Part B, below). 

A. Regulation of Expression of tet Operator-Linked Nucleotide Sequences 

Expression of a tet operator-linked nucleotide sequences is regulated by a 
35 transactivator fusion protein of the invention. Thus, the fusion protein and the target nucleic 
acid are both present in a host cell or organism. The presence of both the transactivator 
fusion protein and the target transcription unit in the same host cell or organism can be 
achieved in a number of different ways. For example, a host cell can be transfected with one 
nucleic acid of the expression system (e.g., encoding the transactivator fusion protein), stably 
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sequence(s) regulates the expression of two operatively linked nucleotide sequences that are 
transcribed in opposite directions from the common tet operator sequence(s). Accordingly, 
one nucleotide sequence is operatively linked to one side of the tet operator sequence (e.g., 
the 5' end on the top strand of DNA) and the other nucleotide sequence is operatively linked 

5 to the opposite side of the tet operator sequence (e.g., the 3 1 end on the top strand of DNA). 
Additionally, it should be understood that each nucleotide sequence to be transcribed includes 
an operatively linked minimal promoter sequence which is located between the nucleotide 
sequence to be transcribed and the tet operator sequences). 

A representative example of such a transcription unit is diagrammed schematically in 

10 Figure 6. In this vectors, the two nucleotide sequences, operatively linked to the same tet 
operator sequence(s), are transcribed in opposite directions relative to the tet operator 
sequence(s) (i.e., the sequences are transcribed in a divergent manner upon activation by a 
transactivator fusion protein of the invention). By "transcribed in opposite directions relative 
to the tet operator sequence(s) w , it is meant that the first nucleotide sequence is transcribed 5' 

15 to V from one strand of the DNA (e.g., the bottom strand) and the second nucleotide 
sequence is transcribed 5' to 3' from the other stand of the DNA (e.g., the top strand), 
resulting in bidirectional transcription away from the tet operator sequences). 

Accordingly, the invention provides a recombinant vector for coordinately-regulated, 
bidirectional transcription of two nucleotide sequence. In one embodiment, the vector 

20 comprises a nucleotide sequence linked by phosphodiester bonds comprising, in a 5' to 3* 
direction: 

• a first nucleotide sequence to be transcribed, operatively linked to 

• at least one tet operator sequence, operatively linked to 
25 • a second nucleotide sequence to be transcribed, 



wherein transcription of the first and second nucleotide sequences proceeds in opposite 
directions from the at least one tet operator sequence(s) (i.e., the first and second nucleotide 
sequences are transcribed in a divergent manner). 
30 In another embodiment, the vector does not include the first and second nucleotide 

sequence to be transcribed but instead contains cloning sites which allow for the introduction 
into the vector of nucleotide sequences of interest Accordingly, in this embodiment, the 
vector comprises a nucleotide sequence comprising in a 5 1 to 3* direction: 

35 • a first cloning site for introduction of a first nucleotide sequence to be 

transcribed, operatively linked to 

• at least one tet operator sequence, operatively linked to 

• a second cloning site for introduction of a second nucleotide sequence to be 
transcribed, 
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marker, such as a surface marker or enzyme (e.g., p-galactosidase or luciferase) which is used 
for selection of cells expressing the gene of interest 

Transcription of the two coordinatety-regulated nucleotide sequences can be induced 
by tetracycline (or an analogue thereof) by use of the Tc-inducible transcriptional activator of 

5 the invention to regulate expression of the two nucleotide sequences. Thus, in this system, 
expression of both nucleotide sequences is "off 1 in the absence of Tc (or analogue), whereas 
expression is turned "on" by the presence of Tc (or analogue). Alternatively, the vector for 
coordinate regulation of two nucleotide sequences can be used in conjunction with other 
tetracycline-regulated transcription factors known in the art For example, a transactivator 

10 fusion protein of a wild-type Tet repressor fused to a transcriptional activation domain, which 
activates gene expression in the absence of Tc (or analogue), such as the tTA described in 
Gossen,M.andBujard,H.(1992)iVoc.^a//.i4cad: ScL USA 22:5547-5551, can also be used 
in conjuction with this target transcription unit for coordinate regulation. 

15 C Independent Regulation of Expression of Multiple Nucleotide Sequences 

The invention still further permits independent and opposite regulation of two or 
more nucleotide sequences to be transcribed. Accordingly, another aspect of the invention 
pertains to a novel tef-regulated transcription unit for independent regulation of two or more 
genes. To independently regulate the expression of two nucleotide sequences to be 

20 transcribed, one nucleotide sequence is operatively linked to a tet operator sequenced) of one 
class type and the other nucleotide sequence is operatively linked to a tet operator 
sequenced) of another class type. Accordingly, the invention provides at least one 
recombinant vector for independent regulation of transcription of two nucleotide sequences. 
In one embodiment, the vectors) comprises: 



25 



30 



• a first nucleotide sequence to be a transcribed operatively linked to at least one 
tet operator sequence of a first class type; and 

• a second nucleotide sequence to be a transcribed operatively linked to at least 
one tet operator sequence of a second class type. 



(It should be understood that each nucleotide sequence to be transcribed also includes an 
operatively linked, upstream minimal promoter sequence.) The two independently regulated 
transcription units can be included on a single vector, or alternatively, on two seperate 
vectors. The recombinant vectors) containing the nucleotide sequences to be transcribed can 
35 be introduced into a host cell or animal as described previously. 

In another embodiment, the vectors) does not include the first and second nucleotide 
sequence to be transcribed but instead contains cloning sites which allow for the introduction 
into the vector of nucleotide sequences of interest Accordingly, in this embodiment, the 
vectors) comprises: 
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For example, the first nucleotide sequence to be transcribed may be linked to a class 
A tet operator and the first fusion protein may bind to class A operators, whereas the second 
nucleotide sequence to be transcribed may be linked to a class B tet operator and the second 
fusion protein may bind to class B operators. Thus, in this embodiment, transcription of the 
5 first nucleotide sequence is activated in the presence of Tc (or analogue thereof) while 

transcription of the second nucleotide sequence is activated in the absence of Tc (or analogue 
thereof). Alternatively, in another embodiment, the first fusion protein binds to class B 
operators and the second fusion protein binds to class A operators. In this case, transcription 
of the second nucleotide sequence is activated in the presence of Tc (or analogue thereof) 

1 0 while transcription of the first nucleotide sequence is activated in the absence of Tc (or 

analogue thereof). Appropriate transactivator proteins for use in this system can be designed 
as described above in Section I and in Gossen and Bujard (1992) cited supra. In order to 
inhibit heterodimerization between the two different types of Tet repressor fusion proteins 
present in the same cell, it may be necessary to mutate the dimerization region of one or both 

15 of the transactivator fusion proteins. Mutations can be targeted to the C-terminal region of 
TetR known to be involved in dimerization. The dimerization region has been described in 
detail based upon the crystal structure of TetR (see Hinrichs, W. et al. (1994) Science 
264:418-420). 

This system allows for independent and opposite regulation of the expression of two 

20 genes by Tc and analogues thereof. Use of different Tc analogues as inducing agents may 
further allow for high, low or intermediate levels of expression of the different sequences 
(discussed in greater detail in Section V below). The novel transcription unit of the invention 
for independently regulating the expression of two genes, described above, can be used in 
situations where two gene products are to be expressed in the same cell but where it is 

25 desirable to express one gene product while expression of the other gene product is turned 
"off 1 , and vice versa. For example, this system is particularly useful for expressing in the 
same host cell either a therapeutic gene or a suicide gene (i.e., a gene which encodes a 
product that can be used to destroy the cell, such as ricin or herpes simplex virus thymidine 
kinase). In many gene therapy situations, it is desirable to be able to express a gene for 

30 therapeutic purposes in a host cell but also to have the capacity to destroy the host cell once 
the therapy is completed. This can be accomplished using the above-described system by 
linking the therapeutic gene to one class of tet operator and the suicide gene to another class 
of tet operator. Thus, expression of the therapeutic gene in a host cell can be stimulated by 
Tc (in which case expression of the suicide gene is absent). Then, once the therapy is 

35 complete, Tc is removed, which turns off expression of the therapeutic gene and turns on 
expression of the suicide gene in the cell. 
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basal, constitutive expression of the gene may vary depending upon the type of cell or tissue 
in which the gene is introduced and on the site of integration of the gene. Alternatively, 
constitutive expression of endogenous genes into which tetO sequences have been introduced 
may vary depending upon the strength of additional endogenous regulatory sequences in the 
5 vicinity. The inhibitor fusion proteins described herein provide compositions that can be 
used to inhibit the expression of such tefO-linked genes in a controlled manner. 

In one embodiment, the inhibitor fusion protein of the invention comprises a first 
polypeptide that binds to tet operator sequences in the absence, but not the presence, of 
tetracycline (Tc) or an analogue thereof operatively linked to a heterologous second 

1 0 polypeptide that inhibits transcription in eukaryotic cells. In another embodiment, the 
inhibitor fusion protein comprises a first polypeptide that binds to tet operator sequences in 
the presence, but not the absence, of tetracycline operatively linked to a heterologous second 
polypeptide that inhibits transcription in eukaryotic cells. The term "heterologous" is 
intended to mean that the second polypeptide is derived from a different protein than the first 

15 polypeptide. Like the transactivator fusion proteins, the transcriptional inhibitor fusion 
proteins can be prepared using standard recombinant DNA techniques as described herein. 

A. The first polypeptide of the transcriptional inhibitor fusion protein 

The transcriptional inhibitor fusion protein of die invention is composed, in part, of a 

20 first polypeptide which binds to a tet operator sequence either (i) in the absence, but not the 
presence of tetracycline (Tc), or an analogue thereof, or alternatively, (ii) in the presence, but 
not the absence of Tc or an analogue thereof. 

Preferably, in the former embodiment, the first polypeptide is a wild-type Tet 
repressor (which binds to tet operator sequences in the absence but not the presence of Tc). 

25 A wild-type Tet repressor of any class (e.g., A, B, C, D or E) may be used as the first 

polypeptide. Preferably, the wild-type Tet repressor is a TnlO-derived Tet repressor. The 
nucleotide and amino acid sequences of a wild-type TnlO-derived Tet repressor are shown in 
SEQ ID NO: 16 and SEQ ID NO: 17, respectively. 

Alternatively, in the latter embodiment, the first polypeptide is a mutated Tet 

30 repressor as described in Section I, part A above (which binds to tet operator sequences in the 
presence but not the absence of Tc), A mutated Tet repressor of any class (e.g., A, B, C, D or 
E) may be used as the first polypeptide: Preferably, the mutated Tet repressor is a Tnl 0- 
derived Tet repressor having one or more amino acid substitutions at positions 71, 95, 101 
and/or 102. The nucleotide and amino acid sequences of such a mutated TnlO-derived Tet 

35 repressor are shown in SEQ ID NO: 1 8 and SEQ ID NO: 1 9, respectively. 



B. The second polypeptide of the transcriptional inhibitor fusion protein 

The first polypeptide of the transcriptional inhibitor fusion protein is operatively 
linked to a second polypeptide which directly or indirectly inhibits transcription in eukaryotic 
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limiting examples of other proteins which have transcriptional repressor activity in eukaryotic 
cells include the Drosophila homeodomain protein even-skipped (eve), the S. cerevisiae 
Ssn6/Tupl protein complex (see Herschbach and Johnson, supra), the yeast SIR1 protein (see 
Chien, et al. (1993) Cell 25:531-541), NePl (see Kohne, et al. (1993) J. Mol Biol. 232:747- 
5 755), the Drosophila dorsal protein (see Kirov, et al. (1994) Mol Cell Biol 14:713-722; 
Jiang, et al. (1993) EMBOJ. 12:3201-3209), TSF3 (see Chen, et al. (1993) Mol Cell Biol 
75:831-840), SF1 (see Targa, et al. (1992) Biochem Biophys. Res. Comm. 158:416-423), the 
Drosophila hunchback protein (see Zhang, et al. (1992) Proc. Nail Acad Set USA 22:751 1- 
7515), the Drosophila knirps protein (see Gerwin, et al. (1994) Mol Cell BioL 14:7899- 

1 0 7908), the WT1 protein (Wilm's tumor gene product) (see Anant, et al. (1 994) Oncogene 
2:31 13-3126; Madden et al., (1993) Oncogene fi:l 713-1720), Oct-2.1 (see Lillycrop, et al. 
(1994) Mol Cell Biol 14:7633-7642), the Drosophila engrailed protein (see Badiani, et al. 
(1994) Genes Dev. &770-782; Han and Manley, (1993) EMBOJ. 12:2723-2733), E4BP4 (see 
Cowell and Hurst, (1994) Nucleic Acids Res. 22:59-65) and ZF5 (see Numoto, et al. (1993) 

15 Nucleic Acids Res. 21:3767-3775), 

In a preferred embodiment, the second polypeptide of the transcriptional inhibitor 
fusion protein of the invention is a transcriptional silencer domain of the Drosophila 
Krueppel protein. A C-terminal region having repressor activity can be used, such as amino 
acids 403-466 of the native protein (see Sauer, F. and Jackie, R, supra). This region is 

20 referred to as C64KR. The nucleotide and amino acid sequences of C64KR are shown in 
SEQ ID NO: 20 and SEQ ID NO: 21, respectively. Construction of an expression vector 
encoding a TetR-C64KR fusion protein is described in Example 4. Alternatively, an alanine- 
rich amino terminal region of Kr that also has repressor activity can be used as the second 
polypeptide of the fusion protein. For example, amino acids 26-1 10 of Kr (see Licht, J.D. et 

25 al., (1990) supra) can be used as the second polypeptide. Alternatively, shorter or longer 
polypeptide fragments encompassing either of the Kr silencer domains that still retain full or 
partial inhibitor activity are also contemplated (e.g., amino acids 62 to 92 of the N-terminal 
silencer domain; see Licht, et al. (1994) supra). 

In another preferred embodiment, the second polypeptide of the transcriptional 

30 inhibitor fusion protein of the invention is a transcriptional silencer domain of the v-erbA 
oncogene product The silencer domain of v-erbA has been mapped to approximately amino 
acid residues 362-632 of the native v-erbA oncogene product (see Baniahmad, et al. supra). 
Accordingly, a fragment encompassing this region is used as the second polypeptide of the 
silencer domain. In one embodiment, amino acid residues 364-635 of the native v-erbA 

35 protein are used. The nucleotide and amino acid sequences of this region of v-erbA are 
shown in SEQ ID NO: 22 and SEQ ID NO: 23, respectively. Constniction of an expression 
vector encoding a TetR-v-erbA fusion protein is described in Example 5. Alternatively, 
shorter or longer polypeptide fragments encompassing the v-erbA silencer region that still 
retain full or partial inhibitor activity are also contemplated. For example, a.a. residues 346- 
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D. Expression of the transcriptional inhibitor fitsion protein 

A nucleic acid molecule encoding a transcriptional inhibitor fusion protein of the 
invention can be incorporated into a recombinant expression vector and introduced into a host 
cell to express the fusion protein in the host cell as described in Section II, Parts A, B and C, 
5 above. Preferably, a host cell expressing a transcriptional inhibitor fusion protein of the 
invention also carries a tet operator-linked gene of interest (i.e., target nucleotide sequence to 
be transcribed). 

Transgenic organisms expressing a transcriptional inhibitor fusion protein in cells 
thereof can be prepared as described in Section n, Part D, above. Moreover, homologous 

1 0 recombinant organisms expressing a transcriptional inhibitor fusion protein in cells thereof 
are also encompassed by the invention and can be prepared as described in Section II, Part E, 
above. The invention provides recombinant expression vectors suitable for homologous 
recombination. In one embodiment, such an expression vector comprises a nucleic acid 
molecule encoding a transcriptional inhibitor fusion protein of the invention which is flanked 

IS at its 5' and 3' ends by additional nucleic acid of a eukaryotic gene, the additional nucleic acid 
being of sufficient length for successful homologous recombination with the eukaryotic gene. 
Vectors and methods for creating homologous recombinant organisms that express the 
components of the regulatory system of the invention, and uses therefor, are described in 
further detail in U.S. Patent Application Serial Number 08/260,452. Preferably, a transgenic 

20 or homologous recombinant organism of the invention expressing a transcriptional inhibitor 
fusion protein in cells thereof also carries a tet operator-linked gene of interest (i.e., target 
nucleotide sequence to be transcribed) in cells thereof. 

V. Kits of the Invention 

25 Another aspect of the invention pertains to kits which include the components of the 

inducible regulatory system of the invention. Such a kit can be used to regulate the 
expression of a gene of interest (i.e., a nucleotide sequence of interest to be transcribed) 
which can be cloned into a target transcription unit The kit may include nucleic acid 
encoding a transcriptional activator fusion protein or a transcriptional inhibitor fusion protein 

30 or both. Alternatively, eukaryotic cells which have nucleic acid encoding a transactivator 
and/or inhibitor fusion protein stably incorporated therein, such that the transactivator and/or 
inhibitor fusion protein are expressed in the eukaryotic cell, may be provided in the kit 
In one embodiment, the kit includes a carrier means having in close confinement 
therein at least two container means: a first container means which contains a first nucleic 

35 acid (e.g., DNA) encoding a transactivator fusion protein of the invention (e.g., a recombinant 
expression vector encoding a first polypeptide which binds to a tet operator sequence in the 
presence of tetracycline operatively linked to a second polypeptide which activates 
transcription in eukaryotic cells), and a second container means which contains a second 
target nucleic acid (e.g., DNA) for the transactivator into which a nucleotide sequence of 
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of the invention. The target transcription units can be in a form which allows for introduction 
into the transcription units of nucleotide sequences of interest to be transcribed. Thus, in 
another embodiment, a kit of the invention includes 1) a first nucleic acid encoding a 
transact! vator which binds to a tet operator of a first class type in the presence of Tc or an 
5 analogue thereof, 2) a second nucleic acid comprising a first cloning site for introduction of a 
first nucleotide sequence to be transcribed operatively linked to at least one tet operator of a 
first class type, 3) a third nucleic acid encoding a transactivator which binds to a tet operator 
of a second class type in the absence of Tc or an analogue thereof, , and 4) a fourth nucleic 
acid comprising a second cloning site for introduction of a second nucleotide sequence to be 

1 0 transcribed operatively linked to at least one tet operator of a second class type. (Optionally, 
minimal promoter sequences are included in the second and fourth nucleic acids). In another 
embodiment, one nucleotide sequence to be transcribed (e.g., encoding a suicide gene) is 
already contained in either the second or the fourth nucleic acid. In yet another embodiment, 
the nucleic acids encoding the transactivators (e.g., the first and third nucleic acids described 

IS above) can be stably introduced into a eukaiyotic cell line which is provided in the kit. 

In yet another embodiment, a kit of the invention includes a first container means 
containing a first nucleic acid encoding a transcriptional inhibitor fusion protein of the 
invention (e.g., the fusion protein inhibits transcription in eukaryotic cells either only in the 
presence of Tc or only the absence of Tc) and a second container means containing a second 

20 nucleic acid comprising a cloning site for introduction of a nucleotide sequence to be 
transcribed operatively linked to at least one tet operator sequence. The kit may further 
include a third nucleic acid encoding a transactivator fusion protein that binds to tetO 
sequences either only in the presence of Tc or only in the absence of Tc. Alternatively, the 
first and/or third nucleic acids (Le., encoding the inhibitor or transactivator fusion proteins) 

25 may be stably incorporated into a eukaryotic host cell which is provided in the kit 
In still another embodiment, a kit of the invention may include at least one 
tetracycline or tetracycline analogue. For example, the kit may include a container means 
which contains tetracycline, anhydrotetracycline, doxycycline, epioxytetracycline or other 
tetracycline analogue described herein. 

30 

VI. Regulation of Gene Expression bv Tetracycline or Analogues Thereof 

A. Stimulation of Gene Expression by Transactivator Fusion Proteins 

In a host cell which carries nucleic acid encoding a transactivator fusion protein of the 
35 invention and a nucleotide sequence operatively linked to the tet operator sequence(i.e., gene 
of interest to be transcribed), high level transcription of the nucleotide sequence operatively 
linked to the tet operator sequences) does not occur in the absence of the inducing agent, 
tetracycline or analogues thereof. The level of basal transcription of the nucleotide sequence 
may vary depending upon the host cell and site of integration of the sequence, but is 
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include oral administration (e.g., dissolving the inducing agent in the drinking water), slow 
release pellets and implantation of a diffusion pump. To administer Tc or a Tc analogue to a 
transgenic plant, the inducing agent can be dissolved in water administered to the plant. 

The ability to use different Tc analogues as inducing agents in this system allows for 
5 modulate the level of expression of a tet operator-linked nucleotide sequence. As 

demonstrated in Example 2, anhydrotetracycline and doxycycline have been found to be 
strong inducing agents. The increase in transcription of the target sequence is typically as 
high as 1000- to 2000-fold, and induction factors as high as 20,000 fold can be achieved. 
Tetracycline, chlorotetracycline and oxytetracycline have been found to be weaker inducing 

1 0 agents, i.e., the increase in transcription of a target sequence is in the range of about 1 0-fold. 
Thus, an appropriate tetracycline analogue is chosen as an inducing agent based upon the 
desired level of induction of gene expression. It is also possible to change the level of gene 
expression in a host cell or animal over time by changing the Tc analogue used as the 
inducing agent. For example, there may be situations where it is desirable to have a strong 

1 5 burst of gene expression initially and then have a sustained lower level of gene expression. 
Accordingly, an analogue which stimulates a high levels of transcription can be used initially 
as the inducing agent and then the inducing agent can be switched to an analogue which 
stimulates a lower level of transcription. Moreover, when regulating the expression of 
multiple nucleotide sequences (e.g., when one sequence is regulated by a one of class tet 

20 operator sequence(s) and the other is regulated by another class of tet operator sequence(s), as 
described above in Section III, Part C, above), it may be possible to independently vary the 
level of expression of each sequence depending upon which transactivator fusion protein is 
used to regulate transcription and which Tc analogue(s) is used as the inducing agent 
Different transactivator fusion proteins are likely to exhibit different levels of responsiveness 

25 to Tc analogues. The level of induction of gene expression by a particular combination of 
transactivator fusion protein and inducing agent (Tc or Tc analogue) can be determined by 
techniques described herein, (e.g., see Example 2). Additionally, die level of gene expression 
can be modulated by varying the concentration of the inducing agent. Thus, the expression 
system of the invention provides a mechanism not only for turning gene expression on or off, 

30 but also for "fine tuning" the level of gene expression at intermediate levels depending upon 
the type and concentration of inducing agent used. 

fl. Inhibition of Gene Expression by Transcriptional Inhibitor Fusion Proteins 

The invention also provides methods for inhibiting gene expression using the 
35 transcriptional inhibitor fusion proteins of the invention. These methods can be used to 
down-regulate basal, constitutive or tissue-specific transcription of a fe/O-iinked gene of 
interest. For example, a gene of interest that is operatively linked to tetO sequences and 
additional positive regulatory elements (e.g., consitutive or tissue-specific enhancer 
sequences) will be transcribed in host cells at a level that is primarily determined by the 
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Section I above) and in USSN 08/270,637 and USSN 08/275,876. Transcriptional inhibitor 
fusion proteins are described herein in Section IV. 

As described above in Section HI, Part C, when more than one TetR fusion protein is 
expressed in a host cell or organism, additional steps may be taken to inhibit 
5 heterodimerization between the different TetR fusion proteins. For example, a transactivator 
composed of a TetR of one class may be used in combination with a transcriptional inhibitor 
composed of a TetR of a second, different class that does not heterodimerize with the first 
class of TetR. Alternatively, amino acid residues of the TetR involved in dimerization may 
be mutated to inhibit heterodimerization. However, even if some heterodimerization between 
1 0 transactivator and inhibitor fusion proteins occurs in a host cell, sufficient amounts of 
homodimers should be produced to allow for efficient positive and negative regulation as 
described herein. 

It will be appreciated by those skilled in the art that various combinations of activator 
and inhibitor proteins can be used to regulate a single fe/O-linked gene of interest in both a 

1 5 positive and negative manner or to regulate multiple te/O-linked genes of interest in a 

coordinated manner or in an independent manner using the teachings described herein. The 
precise regulatory components utilized will depend upon the genes to be regulated and the 
type of regulation desired. Several non-limiting examples of how the transactivator and 
inhibitor fusion proteins may be used in combination are described further below. However, 

20 many other possible combinations will be evident to the skilled artisan in view of the 
teachings herein and are intended to be encompassed by the invention. 

In a preferred embodiment, illustrated schematically in Figure 10, expression of a 
fefO-linked target gene of interest in a host cell is regulated in both a negative and positive 
manner by the combination of an inhibitor fusion protein that binds to tetO in the absence, 

25 but not the presence, of tetracycline or analogue thereof (referred to as a tetracycline 

controlled silencing domain, or tSD) and an activator fusion protein that binds to tetO in the 
presence, but not the absence, of tetracycline or analogue thereof (referred to as a reverse 
tetracycline controlled transactivator, or rtTA). In addition to tetO sequences, the target gene 
is linked to a promoter, and may contain other positive regulatory elements (e.g., enhancer 

30 sequences) that contribute to basal level, constitutive transcription of the gene in the host cell. 
Binding of tSD to the tetO sequences in the absence of tetracycline or analogue (e.g., 
doxycycline) inhibits the basal constitutive transcription of the gene of interest, thus keeping 
the gene of interest in a repressed state until gene expression is desired. When gene 
expression is desired, the concentration of tetracycline or analogue (e.g., doxycycline) in 

35 contact with the host cell increased. Upon addition of the drug, tSD loses the ability to bind 
to tetO sequences whereas the previously unbound rtTA acquires the ability to bind to tetO 
sequences. The resultant binding of rtTA to the tetO sequences linked to the gene of interest 
thus stimulates transcription of the gene of interest The level of expression may be 
controlled by the concentration of tetracycline or analogue, the type of Tc analogue used, the 
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VTT App lications of the Invention 

The invention is widely applicable to a variety of situations where it is desirable to be 
able to turn gene expression on and off, or regulate the level of gene expression, in a rapid, 

5 efficient and controlled manner without causing pleiotropic effects or cytotoxicity. Thus, the 
system of the invention has widespread applicability to the study of cellular development and 
differentiation in eukaryotic cells, plants and animals. For example, expression of oncogenes 
can be regulated in a controlled manner in cells to study their function. Additionally, the 
system can be used to regulate the expression of site-specific recombinases, such as CRE or 

1 0 FLP, to thereby allow for irreversible modification of the genotype of a transgenic organism 
under controlled conditions at a particular stage of development. For example, drug 
resistance markers inserted into the genome of transgenic plants that allow for selection of a 
particular transgenic plant could be irreversibly removed via a Tc-regulated site specific 
recombinase. Other applications of the regulatory system of the invention include: 

15 

A. Gene Therapy 

The invention may be particularly useful for gene therapy purposes, in treatments for 
either genetic or acquired diseases. The general approach of gene therapy involves the 
introduction of nucleic acid into cells such that one or more gene products encoded by the 

20 introduced genetic material are produced in the cells to restore or enhance a functional 
activity. For reviews on gene therapy approaches see Anderson, W.F. (1 992) Science 
256:808-813; Miller, AD. (1992) Nature 357:455-460; Friedmann, T. (1989) Science 
244:1275-1281; and Cournoyer, D., et al. (1990) Curr. Opin Biotech. 1:196-208. However, 
current gene therapy vectors typically utilize constitutive regulatory elements winch are 

25 responsive to endogenous transcriptions factors. These vector systems do not allow for the 
ability to modulate the level of gene expression in a subject In contrast, the inducible 
regulatory system of the invention provides this ability. 

To use the system of the invention for gene therapy purposes, in one embodiment, 
cells of a subject in need of gene therapy are modified to contain 1) nucleic acid encoding a 

30 transactivator fusion protein of die invention in a form suitable for expression of the 
transactivator in the host cells and 2) a gene of interest (e.g., for therapeutic purposes) 
operatively linked to a tet operator sequences). The cells of the subject can be modified ex 
vivo and then introduced into the subj ect or the cells can be directly modified in vivo 
(methods for modification of the cells are described above in Section II). Expression of the 

35 gene of interest in the cells of the subject is then stimulated by administering Tc or a To 
analogue to the patient. The level of gene expression can be varied depending upon which 
particular Tc analogue is used as the inducing agent The level of gene expression can also 
be modulated by adjusting the dose of the tetracycline, or analogue thereof, administered to 
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Proc. Natl. Acad ScL USA 29:10892-10895; van Beusechem, V.W. et al. (1992) Proc. Natl 
Acad Set S&7640-7644; Ros^ 

al. (1992) Human Gene Therapy 2:641-647; Cristiano, RJ. et al. (1993) Proc. Natl Acad 
ScL USA 20:2122-2126; Hwu, P. et al. (1993) J. Immunol 15Q:41 04-41 15; and Herz, J. and 
5 Gerard, R.D. (1993) Proc. Natl Acad ScL USA 2Q281 2-2816. 

Gene therapy applications of particular interest in cancer treatment include 
overexpression of a cytokine gene (e.g., TNF-a) in tumor infiltrating lymphocytes or ectopic 
expression of cytokines in tumor cells to induce an anti-tumor immune response at the tumor 
site), expression of an enzyme in tumor cells which can convert a non-toxic agent into a toxic 

1 0 agent, expression of tumor specific antigens to induce an anti-tumor immune response, 
expression of tumor suppressor genes (e.g., p53 or Rb) in tumor cells, expression of a 
multidrug resistance gene (e.g., MDR1 and/or MRP) in bone marrow cells to protect them 
from the toxicity of chemotherapy. 

Gene therapy applications of particular interest in treatment of viral diseases include 

1 5 expression of trans-dominant negative viral transactivation proteins, such as trans-dominant 
negative tat and rev mutants for HTV or trans-dominant ICp4 mutants for HSV (see e.g., 
Balboni, P.G. et al. (1993) 1 Med Virol 41:289-295; Liem, S J. et al. (1993) Hum. Gene 
Ther. 4:625-634; Malim, M.H. et al. (1992) J. Exp. Med 126:1197-1201; Daly, T.J. et al. 
(1993) Biochemistry 22:8945-8954; and Smith, C.A. et al. (1992) Virology 121:58 1-588), 

20 expression of trans-dominant negative envelope proteins, such as env mutants for HIV (see 
e.g., Steffy, K.R. et al. (1993) J. Virol 62:1854-1859), intracellular expression of antibodies, 
or fragments thereof, directed to viral products ("internal immunization", see e.g., Marasco, 
W.A. et al. (1993) Proc. Natl. Acad ScL USA 2Q:7889-7893) and expression of soluble viral 
receptors, such as soluble CD4. Additionally, the system of the invention can be used to 

25 conditionally express a suicide gene in cells, thereby allowing for elimination of the cells 
after they have served an intended function. For example, cells used for vaccination can be 
eliminated in a subject after an immune response has been generated the subject by inducing 
expression of a suicide gene in the cells by- administering Tc or a Tc analogue to the subject. 
The Tc-controlled regulatory system of the invention has numerous advantages 

30 properties that it particularly suitable for application to gene therapy. For example, the 

system provides an "onVofF switch for gene expression that allows for regulated dosaging a 
gene product in a subject There are several situations in which it may be desirable to be able 
to provide a gene product at specific levels and/or times in a regulated manner, rather than 
simply expressing the gene product constitutively at a set level. For example, a gene of 

35 interest can be switched "on" at fixed intervals (e.g., daily, alternate days, weekly, etc.) to 
provide the most effective level of a gene product of interest at the most effective time. The 
level of gene product produced in a subject can be monitored by standard methods (e.g., 
direct monitoring using an immunological assay such as ELISA or R1A or indirectly by 
monitoring of a laboratory parameter dependent upon the function of the gene product of 
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itself is controlled by, for example, a tumor-specific promoter or a radiation-induced 
promoter. 

In another embodiment, the regulatory system of the invention is used to express 
angiogenesis inhibitors) from within a tumor via a transgene regulated by the system of the 
5 invention. Expression of angiogenesis inhibitors in this manner may be more efficient than 
systemic administration of the inhibitor and would avoid any deleterious side effects that 
might accompany systemic administration. In particular, restricting angiogenesis inhibitor 
expression to within tumors could be particularly useful in treating cancer in children still 
undergoing angiogenesis associated with normal cell growth. 

1 0 In another embodiment, high level regulated expression of cytokines may represent a 

method for focusing a patients own immune response on tumor cells. Tumor cells can be 
transduced to express chemoattractant and growth promoting cytokines important in 
increasing an individual's natural immune response. Because the highest concentrations of 
cytokines will be in the proximity of the tumor, the likelihood of eliciting an immunological 

1 5 response to tumor antigens is increased. A potential problem with this type of therapy is that 
those tumor cells producing the cytokines will also be targets of the immune response and 
therefor the source of the cytokines will be eliminated before eradication of all tumor cells 
can be certain. To combat this, expression of viral proteins known to mask infected cells 
from the immune system can be placed under regulation, along with the cytokine gene(s), in 

20 the same cells. One such protein is the El 9 protein from adenovirus (see e.g., Cox, Science 
247:71 5). This protein prevents transport of class I HLA antigens to the surface of the cell 
and hence prevents recognition and lysis of the cell by the host's cytotoxic T cells. 
Accordingly, regulated expression of E19 in tumor cells could shield cytokine producer cells 
from cytotoxic T cells during the onset of an immune response provoked by cytokine 

25 expression. After a sufficient period of time has elapsed to eradicate all tumor cells but those 
expressing E19, E19 expression can be turned off, causing these cells then to fall victim to 
the provoked anti-tumor immune response. 

Benign prostatic hypertrophy - Similar to the above, a suicide gene can be regulated 
by a regulator of the invention, wherein the regulator itself is controlled by, for example, a 

30 prostate-specific promoter. 

The ability to express a suicide gene (e.g., an apoptosis gene, TK gene, etc) in a 
controlled manner using the regulatory system of the invention adds to the general safety and 
usefulness of the system. For example, at the end of a desired therapy, expression of a 
suicide gene can be triggered to eliminate cells carrying the gene therapy vector, such as cells 

35 in a bioinert implant, cells that have disseminated beyond die intended original location, etc. 
Moreover, if a transplant becomes tumorous or has side effects, the cells can be rapidly 
eliminated by induction of the suicide gene. The use of more than one Tc-controlled 
H on H / H ofF switch in one cell allows for completely independent regulation of a suicide gene 
compared to regulation of a gene of therapeutic interest (as described in detail herein). 
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cells of a localized transplant under the skin, such as keratinocytes, while avoiding any 
possible side effects from systemic treatment). 

In certain gene therapy situations, it may be necessary or desirable to take steps to 
avoid or inhibit unwanted immune reactions in a subject receiving treatment To avoid a 
5 reaction against the cells expressing the therapeutic gene product, a subject's own cells are 
generally used, when possible, to express the therapeutic gene product, either by in vivo 
modification of the subject's cells or by obtaining cells from the subject, modifying them ex 
vivo and returning them to the subject In situations where allogeneic or xenogeneic cells are 
used to express a gene product of interest, the regulatory system of the invention, in addition 

1 0 to regulating a therapeutic gene, can also be used to regulate one or more genes involved in 
the immune recognition of the cells to inhibit an immune reaction against the foreign cells. 
For example, cell-surface molecules involved in recognition of a foreign cell by T 
lymphocytes can be downmodulated on the surface of a foreign cell used for delivery of a 
therapeutic gene product, such as by regulated expression in the foreign cell of a ribozyme 

1 5 which cleaves the mRNA encoding the cell-surface molecule. Particularly preferred cell 
surface molecules which can be downmodulated in this manner to inhibit an unwanted 
immune response include class I and/or class II major histocompatibility complex (MHC) 
molecules, costimulatory molecules (e.g., B7-1 and/or B7-2), CD40, and various "adhesion" 
molecules, such as ICAM-1 or ICAM-2. Using approaches described herein for independent 

20 but coordinate regulation of multiple genes in the same cell, the down-regulation of 
expression of a cell-surface molecule(s) in a host cell can be coordinated with the up- 
regulation of expression of a therapeutic gene. Accordingly, after therapy is completed and 
expression of the therapeutic gene is halted, expression of the endogenous cell surface 
molecule(s) can be restored to normal. 

25 Furthermore, as described above regarding anti-cancer treatments, a viral protein (e.g., 

adenovirus E19 protein) that downmodulates expression of MHC antigens can be regulated in 
host cells using the system of the invention as a means of avoiding unwanted immunological 
reactions. 

In addition to avoiding or inhibiting an immune response against a foreign cell 
30 delivering a therapeutic gene product, it may also be necessary, in certain situations, to avoid 
or inhibit an immune response against certain components of the regulatory system of the 
invention (e.g., the regulator fusion proteins described herein) that are expressed in a subject, 
since these fusion proteins contain non-mammalian polypeptides that may stimulate an 
unwanted immune reaction. In this regard, regulator fusion proteins can be designed and/or 
35 selected for a decreased ability to stimulate an immune response in a host For example, a 
transcriptional activator domain for use in the regulator fusion protein can be chosen which 
has minimal immunogenicity. In this regard, a wild-type transcriptional activation domain of 
the herpes simplex virus protein VP16 may not be a preferred transcriptional activation 
domain for use in vivo, since it may stimulate an immune response in mammals. Alternative 
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encoding a transactivator fusion protein of the invention and a nucleic acid encoding the 
protein of the interest operatively linked to at least one tet operator sequence, is grown at 
production scale in a culture medium in the presence of tetracycline or a tetracycline 
analogue to stimulate transcription of the nucleotides sequence encoding the protein of 
5 interest (i.e„ the nucleotide sequence operatively linked to the /c/ oporator sequrace(s)) and 
the protein of interest is isolated from harvested host cells or from the culture medium. 
Standard protein purification techniques can be used to isolate the protein of interest from the 
medium or from the harvested cells. 

10 C Production of Proteins in Vivo 

The invention also provides for large scale production of a protein of interest in 
animals, such as in transgenic farm animals. Advances in transgenic technology have made it 
possible to produce transgenic livestock, such as cattle, goats, pigs and sheep (reviewed in 
Wall, R.J. et al. (1992) I Cell Biochem 42:1 13-120; and Clark, A J. et al. (1987) Trends in 

1 5 Biotechnology 5:20-24). Accordingly, transgenic livestock carrying in their genome the 

components of the inducible regulatory system of the invention can be constructed, wherein a 
gene encoding a protein of interest is operatively linked to at least one tet operator sequence. 
Gene expression, and thus protein production, is induced by administering Tc (or analogue 
thereof) to the transgenic animal. Protein production can be targeted to a particular tissue by 

20 linking the nucleic acid encoding the transactivator fusion protein to an appropriate tissue- 
specific regulatory elements) which limits expression of the transactivator to certain cells. 
For example, a mammary gland-specific regulatory element, such as the milk whey promoter 
(U.S. Patent No. 4,8733 1 6 and European Application Publication No. 264,1 66), can be 
linked to the transactivator transgene to limit expression of the transactivator to mammary 

25 tissue. Thus, in the presence of Tc (or analogue), the protein of interest will be produced in 
the mammary tissue of the transgenic animal. The protein can be designed to be secreted into 
the milk of the transgenic animal, and if desired, the protein can then be isolated from the 
milk. 

30 D. Animal Models of Human Disease 

The transcriptional activator and inhibitor proteins of the invention can be used alone 
or in combination to stimulate or inhibit expression of specific genes in animals to mimic the 
pathophysiology of human disease to thereby create animal models of human disease. For 
example, in a host animal, a gene of interest thought to be involved in a disease can be placed 

35 under the transcriptional control of one or more tet operator sequences (e.g., by homologous 
recombination, as described herein). Such an animal can be mated to a second animal 
carrying one or more transgenes for a transactivator fusion protein and/or an inhibitor fusion 
protein to create progeny that carry both a tetracycline-regulated fusion protein(s) gene and a 
tet-regulated target sequence. Expression of the gene of interest in these progeny can be 
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inhibitor fusion proteins of the invention contain a eukaryotic transcriptional silencer domain, 
these fusion proteins should be more efficient at repressing transcription in eukaryotic cells, 
and thus may potentially require lower intracellular concentrations for efficient repression 
with less liklihood of "leakiness". Additionally, by insertion of tetO sequences into the 
5 regulatory region of an endogenous gene, the transcriptional inhibitor fusion proteins of the 
invention can be used to down-regulate constitutive and/or tissue-specific expression of 
endogenous genes. 

Furthermore, in contrast to various versions of the lac system (e.g., Labow et al. 
(1990) Mol Cell Biol JU:3343-3356; Bairn et al. (1991) Proc. Natl Acad ScL USA 

10 24:5072-5076), which are limited by the negative properties of the inducing agent (IPTG) 
and/or by the need to increase the temperature in order to induce gene expression (which may 
elicit pleiotropic effects), the inducing agent used in the system of the invention (Tc or an 
analogue thereof) has many advantageous properties: l)Tc and analogues thereof exhibit 
high affinity for TetR and low toxicity for eukaryotic cells, and thus can be used for gene 

1 5 induction at concentrations that do not affect cell growth or morphology; 2) Tc analogues 
which retain TetR binding but which have reduced antibiotic activity exist and can be used as 
inducing agents, thereby avoiding possible side effects from the antibiotic property of Tc; 3) 
the pharmacokinetic properties of Tc and Tc analogues enable rapid and efficient cellular 
uptake and penetration of physiological barriers, such as the placenta or the blood-brain 

20 barrier, and 4) Tc analogues with different induction capabilities permit modulation of the 
level of gene expression. 

Thus, the invention provides an inducible regulatory system which allows for rapid 
activation of gene transcription without cellular toxicity and a range of induction indices. 
The increase in gene expression upon induction typically is between 1 000- and 2000-fold and 

25 can be as high as about 20,000-fold. Alternatively, lower levels of gene induction, e.g., 1 0- 
fold, can be achieved depending upon which inducing agent is used. This system can be 
utilized in a wide range of applications. These applications include gene therapy, large-scale 
production of proteins in cultured cells or in transgenic farm animals, and the study of gene 
function, for example in relationship to cellular development and differentiation. Moreover, 

30 the novel transcription units of the invention allow for coordinate or independent regulation 
of the expression of multiple genes utilizing the regulatory components of the invention. 



35 



This invention is further illustrated by the following examples which should not be 
construed as limiting. The contents of all references, patents and published patent 
applications cited throughout this application are hereby incorporated by reference. 
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a? (position) 




affected codon 




wild-type 


mutant 


wild-type 


mutant 


glu(71) 


lys 


GAA 


AAA 


asp (95) 


asn 


GAT 


AAT 


leu (101) 


ser 


TTA 


TCA 


gly(102) 


asp 


GGT 


GAT 



Two additional mutations did not result in an amino acid exchange: 

5 

m (position) aflfat<rf codon 

wild-tvpe mutant wild-type mutant 

leu (41) leu TTG CTG 

arg (80) arg CGT CGC 

To convert the rTetR mutant to a transcriptional activator, a 399 base pair 
XbaI/Eco47IIl fragment encoding amino acids 3 to 135 of rTetR (i.e., encompassing the 

1 0 mutated region) was exchanged for the corresponding restriction fragment of the expression 
vector pUHD15-l to create pUHD17-l. In pUHDl 5- 1 , nucleotide sequences encoding wild- 
type TetR are linked in frame to nucleotide sequences encoding the C-terminal 130 amino 
acids of herpes simplex virus VP16. These transactivator sequences are flanked upstream by 
a CMV promoter/enhancer and downstream by an S V40 poly(A) site (the construction of 

15 pUHD15-l is described in more detail in USSN 08/076,726 and Gossen, M. and Bujard, H. 
(1992) Proc. NatL Acad Set ££5547-5551). Thus, in pUHDl 7-1, nucleotide 
sequences encoding the reverse TetR mutant are linked in frame to VP 1 6 sequences to create 
a reverse Tc-controlled transactivator (referred to herein as tTA R ). The analogous exchange 
of the mutated region of rTetR for the wild-type region of TetR was performed with plasmid 

20 pUHD152-l, which is the same as pUHDl 5-1 except that it additionally contains nucleotide 
sequences encoding a nuclear localization signal linked in-frame to the 5 ! end of the 
nucleotide sequences encoding the Tet repressor. The amino acid sequence of the nuclear 
localization signal is MPKRPRP (SEQ ID NO: 5), which is linked to the serine at amino acid 
position 2 of TetR. The resulting expression vector encoding the reverse Tc-controlled 

25 transactivator including a nuclear localization signal (referred to herein as ntTA R ) was named 
pUHD172-l. 
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SlabJeJiansfestiim f ^ Annna ivsis expression vectors were prepared for stable 
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for hygromycin resistance (see Bernhard, H-U. et al. (1985) Exp, Cell Res. 15&237-243), and 
hygromycin resistant clones were selected. In an analogous experiment, HR5 cells were 
cotransfected with pUH13-7 and pHMR272. pUH13-7 contains a minimal promoter 
sequence spanning position +19 to -37 of the HSVtk promoter adjacent to the heptameric 
5 tetO sequences, rather than a minimal CMV promoter. From 21 hygromycin resistant clones, 
10 showed inducible luciferase activity upon addition of Tc or doxy cy line (Dc) to the culture 
medium Clones containing the luciferase reporter gene linked to a minimal CMV promoter 
are referred to as HR5-C, whereas those containing the luciferase reporter gene linked to a 
minimal tk promoter are referred to HR5-T. 

10 Six of the HR5 clones stably transfected with a ntTA R -dependent reporter unit and 

previously shown to be responsive to tetracyclines were grown in parallel in the absence or 
presence of 1 pg/ml doxycycline. About 3 x 10 4 cells were plated in each 35 mm dish (4 
dishes for each clone). After growth for 60 hours, cells were harvested and the luciferase 
activity of the extracts (in relative light units (rlu)/jig extracted protein) was determined. As 

1 5 shown in Table 1 , the absolute expression levels of six clones demonstrate that activation of 
luciferase gene expression over 3 orders of magnitude is achieved in several of the double 
stable cell lines containing the ntTA R regulatory system. 

It should be noted that even higher induction factors (e.g., as high as a 20,000-fold 
increase in expression) could be achieved if, instead of simply adding the inducing agent to 

20 the culture medium, the cells were washed prior to induction and then replated in fresh 
culture medium containing the inducing agent 

TABLE 1 . Doxvcycline-de pendent luciferase activity of double stable hic+/HR5 cell clones 
Luciferase Activity, rlu/jig protein 



25 


Clone 


-Poxycycline 




Induction Factor 


HR5-C6 


65 


54,911 


845 






62 


69,525 . 


1120 




HR5-C11 


100 


165,671 


1660 


30 




142 


179,651 


1270 




HR5-C14 


43 


44,493 


1030 






43 


56,274 


1310 


35 


HR5-T2 


56 


16,696 


298 






40 


16,416 


410 




HR5-T15 


6.8 


1838 


270 






6.5 


1688 


260 


40 












HR5-T19 


4.8 


1135 


236 






5.4 


1285 


237 
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20 



25 



30 



35 
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these results indicate that induction of gene expression occurs rapidly following exposure of 
the cells to the inducing agent 

EXAMPLE 3: Coordinate Regulation of the Expression of Two Nucleotide 
. 5 Sequences by a Tc-Controlled Transcriptional Activator 

A recombinant expression vector for coordinate, bidirectional transcription of two 
nucleotide sequences was constructed comprising, in a 5' to 3* direction: a luciferase gene, a 
first minimal promoter, seven tet operator sequences, a second minimal promoter and a LacZ 

10 gene. The construct is illustrated in Figure 6. In this construct, the luciferase and LacZ genes 
are oriented such that they are transcribed in opposite orientations relative to the tet operator 
sequences, Le., the luciferase gene is transcribed in a 5' to 3* direction from the bottom strand 
of DNA, whereas the LacZ gene is transcribed in a 5' to 3' direction from the top strand of 
DNA. The luciferase gene is followed by an SV40 polyadenylation signal, whereas the LacZ 

1 5 gene is followed by a p-globin polyadenylation signal. 

The construct was transfected into the HeLa cell line HtTA-1 cells, which express a 
wild-type Tet repressor-VPl 6 fusion protein (referred to as tTA and described in Gossen, M. 
and Bujard,H. (1992) Proc. Natl Acad Set U£4 32:5547-5551). The tTA fusion protein 
binds to tet operator sequences in the absence of Tc (or analogue) but not in the presence of 

20 Tc (or analogue). The construct was cotransfected into HtTA- 1 cells with a plasmid which 
confers hygromycin resistance and stably transfected clones were selected based upon their 
hygromycin resistant phenotype. Selected hygromycin resistant (Hygr 1 ) clones were 
examined for luciferase and P-galactosidase activity. Clones positive for all three markers 
(Hygi* luc+ P-gal + ) were then examined for tetracycline-dependent coregulation of 

25 expression of luciferase and f}-galactosidase activity by cultureing the clones in increasing 
amounts of tetracycline and measuring luciferase and p-galactosidase activity. The results of 
such an experiment using clone Ht 13 16-8/50 are shown in Figure 8. In the absence of 
tetracycline (in which case tTA can bind to tet operators and activate gene expression), both 
luciferase and p-galactosidase activity is detected. In the presence of increasing amounts of 

30 tetracycline, luciferase and p-galactosidase activity are coordinately and equivalency 
downregulated. This data demonstrates that expression of two genes can be coordinately 
regulated by a tetracy cline-controlled transacti vator by operatively linking the two genes to 
die same tet operator sequence(s). 

35 EXAMPLE 4 : Construction of a Tetracycline-Regulated Transcriptional Inhibitor 
Fusion Protein Comprising TetR and a Krueppd Silencer Domain 

To contract an expression vector encoding a tetracycline-regulated transcriptional 
inhibitor of the invention (also referred to as a tetacycline controlled silencer domain, or 
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Gly-Ser corresponds to amino acids 205-207 of TetR, Pro-Gly are encoded by the poly linker 
and Asp-Met-Glu correspond to the amino acids 403-405 of C64KR. 

Similarly, an expression vector encoding a fusion protein of a mutated TetR (that 
binds to tetO only in the presence of Tc) and C64KR can be constructed as described above 
5 using nucleotide sequences encoding a mutant TetR (the nucleotide and amino acid 

sequences of which are shown in SEQ ID NOs: 1 8 and 19, respectively) in place of the wild 
type TetR sequences in pUHD141-smal. 

An expression vector encoding a TetR-Kr fusion protein (e.g., pUHD141kr-l) is 
transiently or stably transfected into host cells as described in Example 2 to express the TetR- 

10 Kr fusion protein in the host cell. A reporter gene construct containing one or more tetO 
sequences, a minimal promoter and a reporter gene, such as luciferase, is also transfected into 
the cells as described in Example 2. (Reporter gene constructs are described in further detail 
in USSN 08/076,726 and Gossen, M. and Bujard, H. (1992) Proc. Natl Acad Sci USA 
22:5547-555 1). Luciferase activity in the presence and absence of increasing concentrations 

15 of Tc or an analogue, e.g., doxycy cline, is measured as described in Example 2. For the wild 
type TetR-Kr fusion protein described above, the transcriptional inhibiting ability of the 
fusion protein is determined by comparing the amount of luciferase activity in the presence of 
doxycycline (no repression) to the amount of luciferase activity in the absence of doxycycline 
(repression). The transcriptional inhibiting activity of the fusion protein can also be tested 

20 using reporter gene constructs that exhibit higher basal levels of expression (i.e., higher levels 
of expression in the presence of doxycycline) by using a reporter gene construct that contains 
additional positive regulatory elements (e.g., enhancer sequences). 

EXAMPLE 5 : Construction of a Tetracycline-Regulated Transcriptional Inhibitor 
25 Fusion Protein Comprising TetR and a v-erbA Silencer Domain 

To construct an expression vector encoding a fusion protein between TetR and a 
transcriptional silencer domain from the v-erbA oncogene product, a nucleic acid fragment 
encoding a silencer domain from v-erbA is ligated in-frame into pUHD141sma-l as 

30 described in Example 4. A nucleic acid fragment encoding a v-erbA silencer domain suitable 
for ligation into pUHD141sma-l is amplified by the polymerase chain reaction (PCR) using a 
v-erbA cDNA as a template. Oligonucleotide primers are designed which amplify a nucleic 
acid fragment encoding amino acids 364-635 of the native v-erbA protein. The nucleotide 
and amino acid sequences of this region of v-erbA are shown in SEQ ID NO: 22 and SEQ ID 

35 NO: 23, respectively. As described in Example 4, PCR primers are designed such that the 
amplified v-erbA fragment contains restriction endonuclease sites at its 5* and 3' ends, such as 
PspAI at the 5* end and BamHI at the 3* end. After a standard PCR reaction, the amplified 
fragment and pUHD141-smal are digested with PspAI and BamHI. The amplified fragment 
is then ligated directipnally into the polylinker site of pUHD141-smal using standard ligation 
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of the human CMV IE promoter (position +75 to -675) together with a rabbit 0-globin 
polyadenylation site including an intron. The human CMV IE promoter is a constitutive 
promoter that allows expression of the mutated tetR-VP16 fusion protein in all cells where 
chromosomal integration of the DNA sequence encoding tTA R has occurred. DNAwas 
5 injected into fertilized oocytes at a concentration of approximately 5 ng per \xl by standard 
techniques. Transgenic mice were generated from the injected fertilized oocytes according to 
standard procedures. Transgenic founder mice were analyzed using polymerase chain 
reaction (PCR) and Southern hybridization to detect the presence of the tTA R transgene in 
chromosomal DNA of the mice. Two transgenic mouse lines, CR3 and CR4 were identified 
10 and crossbred with another transgenic mouse line carrying a luciferase reporter gene under 
the control of teiO sequences, (described further below). 

Generation of mice transgenic for the Phr^J. luciferase reporter unit 

Mice carrying a PhCMVM luc reporter gene expression unit were generated by 

1 5 pronuclear injection into fertilized oocytes of a 3.1 kb Xhol-Eael fragment excised from 
plasmid pUHC 13-3. This DNA-fragment contains the luciferase gene under transcriptional 
control of the tetracycline-responsive PhCMVM promoter (SEQ ID NO: 8), together with an 
SV40 1 early polyadenylation site including an intron. DNA was injected into oocytes at a 
concentration of approximately 5 ng per (d and transgenic mice were generated according to 

20 standard procedures. Transgenic founder mice were analyzed using Southern hybridization 
to detect the presence of the PhCMVM luc transgene in chromosomal DNA of the mice. A 
mouse line transgenic for the tefO-linked luciferase reporter gene, L7, was crossbred with the 
tTA R transgenic lines CR3 and CR4 (described further below). 

25 Generation of mice transgenic for the PhCMVM and PhCMvJlA* 

Having constructed single transgenic mice expressing tTA R or carrying PhCMVM 
luc, double transgenic mice carrying both the tTA expression vector and the luciferase 
reporter-units were obtained through cross breeding of heterozygous mice transgenic for one 
of the two transgenes. Double transgenic animals were identified by standard screenings 

30 (eg., PCR and/or Southern hybridization) to detect the presence of both the tTA R transgene 
and the PhCMVM luc transgene in chromosomal DNA of the mice. 

Induction and analysis of luciferase activity in tiss ue samples from mice 

For oral administration, tetracycline or its derivative doxycycline were given in the 
35 drinking water at a concentration of 200 ^ig per ml with 5 % sucrose to hide the bitter taste of 

the antibiotics. For lactatmg mice, the concentration was 2 mg per ml with 10 % sucrose to 

ensure a sufficient uptake via the milk by the young. 

To analyze luciferase activity, mice were killed by cervical dislocation and tissue 

samples were homogenized in 2 ml tubes containing 500 pJ lysis-buffer (25 mM Tris 
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SEQUENCB LISTING 



(1) GENERAL INFORMATION : 

<i) APPLICANTS: 

(A) NAME: Hermann Bujard 

(B) STREET: Remler Str. 9 

(C) CITY: D-69120 Heidelberg 
(E) COUNTRY: Germany 

(A) NAME: Manfred Gossen 

(B) STREET: 978 Arlington Boulevard #B 

(C) CITY: Bl Cerrito 

(D) STATE: California 

(E) COUNTRY: USA 

(P) POSTAL CODB (ZIP) : 94530 

(ii) TITLE OF INVENTION: Tetracycline -Regulates 

Modulators 

(iii) NUMBER OF SEQUENCES: 28 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: LAHXVB & COCKFIELD 

<B) STREET: 60 state Street, suite 510 

(C) CITY: Boston 

<D) STATE: Massachusetts 

(E) COUNTRY: USA 

(F) ZIP: 02109-1875 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 
<B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: ASCII Text 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: To be assigned 

(B) FILING DATE: 07-JUN-1995 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 08/383,754 

(B) FILING DATE: 03-FKB-1995 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 08/275,876 

(B) FILING DATE: 15- JULY- 94 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA 

(A) APPLICATION NUMBER: US 08/270,637 

(B) FILING DATE: 01- JULY- 94 

(C) CLASSIFICATION: 



(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: DeConti, Giulio A. Jr. 

(B) REGISTRATION NUMBER: 31,503 
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(C) REFERENCE/DOCKET NUMBER: BBI-009C2PC 

(ix) TELECOMMUNICATION INFORMATION : 

(A) TELEPHONE: (617)227-7400 

(B) TELEFAX: (617)227-5941 



(2) 



INFORMATION FOR SEQ ID NO:l: 



1rt (i) SEQUENCE CHARACTERISTICS: 

1 (A) LENGTH: 100B base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

15 MOLECULE TYPE: DNA 



20 



25 



(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 1..1008 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 1. .100B 

(ix) FEATURE: w . „ 

(A) NAME/KEY: misc. binding 

(B) LOCATION: 1..207 



10 (ix) FEATURE: 

3U (A) name/key : misc. binding 

(B) LOCATION: 208.. 335 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
J ( B ) LOCATION: 1..1005 



(xi ) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

» ^ *™ era ATT AAC AGC GCA TTA GAG CTG 48 
ATG TCT AGA TTA GAT AAA AGT AAA GTG ATT AAC ^ ^ ^ 

40 Met Ser Arg Leu Asp Lys Ser Lys Val lie as ^ 

==S=SS25===«= = = 

45 20 

- ™- %n rra TAT TGG CAT GTA AAA AAT AAG 

5 Z Z S S 2 S S S S - - « * - - 

35 40 

50 ^ o ^ ATT GAG ATG TTA GAT AGO CAC CAT 

CGG GCT TTG CTC GAC GCC TTA GCC ATT ^ ^ ^ ^ ^ 

Arg Ala Leu Leu Asp Ala Leu Ala lie * gQ 



96 



144 



.50 55 
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ACT CAC TTT TGC CCT TTA AAA GGG 6AA AGC TGG CAA GAT TTT TTA CGC 
Thr His Phe Cys Pro Leu Lys Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 75 60 



240 



AAT AAG GCT AAA AGT TTT AGA TGT GCT TTA CTA AGT CAT CGC AAT GGA 
Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asn Gly 
85 90 95 



288 



GCA AAA GTA CAT TCA GAT ACA CGG CCT ACA GAA AAA GAG TAT GAA ACT 
Ala Lys Val His Ser Asp Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 



336 



CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 
Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 



384 



AAT GCA TTA TAT GCA CTC AGC GCT GTG GGG CAT TTT ACT TTA GGT TGC 
Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 



432 



GTA TTG GAA GAT CAA GAG CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA 
Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 160 



480 



CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 
Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 175 



528 



TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG 
Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 



576 



ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCC GCG 
lie He Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser Ala 
195 200 205 



624 



TAG AGC CGC GCG CGT ACG AAA AAC AAT TAC GGG TCT ACC ATC GAG GGC 
Tyr Ser Arg Ala Arg Thr Lys Asn Asn Tyr Gly Ser Thr He Glu Gly 
210 215 220 



672 



CTG CTC GAT CTC CCG GAC GAC GAC GCC CCC GAA GAG GCG GGG CTG GCG 
Leu Leu Asp Leu Pro Asp Asp Asp Ala Pro Glu Glu Ala Gly Leu Ala 
225 230 235 240 



720 



GCT CCG CGC CTG TCC TTT CTC CCC GCG GGA CAC ACG CGC AGA CTG TCG 
Ala Pro Arg Leu Ser Phe Leu Pro Ala Gly His Thr Arg Arg Leu Ser 
245 250 255 



768 



ACG GCC CCC CCG ACC GAT GTC AGC CTG GGG GAC GAG CTC CAC TTA GAC 
Thr Ala Pro Pro Thr Asp Val Ser Leu Gly Asp Glu Leu His Leu Asp 
260 265 270 



816 



GGC GAG GAC GTG GCG ATG GCG CAT GCC GAC GCG CTA GAC GAT TTC GAT 
Gly Glu Asp Val Ala Met Ala His Ala Asp Ala Leu Asp Asp Phe Asp 
275 280 285 



864 



CTG GAC ATG TTG GGG GAC GGG GAT TCC CCG GGT CCG GGA TTT ACC CCC 
Leu Asp Met Leu Gly Asp Gly Asp Ser Pro Gly Pro Gly Phe Thr Pro 
290 295 300 



912 
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SS===S5=====S=== " 

305 



5 „ nra MT GAC GAG TAC GOT GGG TAG 1008 



325 330 
10 • (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 335 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:2: 

*■ - - « » - s " M ' "5 1 *" 

20 ,1 Asn . -V » - « - ~ ~ « ~ "S *" "* 
20 

ly . u. W « - «.»-.-«■«» - « » - » 

35 40 
25 ^A.aLeuLenAspMaLeuAialleOluHetLeuAspAr.HisHis 

^ l Phe Cys Pro Leu Lys Gly «U Ser Trp Gin Asp Phe Len Arg 
30 «5 70 

^ Ma Leu Leu Ser His Arg Asn Gly 
Asn Lys Ala Lys Ser Phe Arg Cys Aia i* g5 

85 

. lre pro Thr Glu Lys Gin Tyr Glu Thr 

35 Ala Lys Val His ser Asp Thr Arg Pro Tnr ^ 

100 

t«, Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
Leu Glu Asn Gin Leu Ala Pne i*u v.* 125 

115 12 



40 



Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr u. GXy Cys 



130 135 



val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
45 145 • 150 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 A/ 

50 «.„,«. «>» W »• « - - " "* "* S ° l " "~ 

180 A ° 

„. ». „. «, » « «■ - - * <- - s 011 Mt 

195 

55 
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Tyr Ser Arg Ala Arg Thr Lys Asn Asn Tyr Gly Ser Thr lie Glu Gly 
210 215 220 

Leu Leu Asp Leu Pro Asp Asp Asp Ala Pro Glu Glu Ala Gly Leu Ala 
225 230 235 240 

Ala Pro Arg Leu Ser Phe Leu Pro Ala Gly His Thr Arg Arg Leu Ser 
245 250 255 

Thr Ala Pro Pro Thr Asp Val Ser Leu Gly Asp Glu Leu His Leu Asp 
260 265 270 

Gly Glu Asp Val Ala Met Ala His Ala Asp Ala Leu Asp Asp Phe Asp 
275 280 285 

Leu Asp Met Leu Gly Asp Gly Asp Ser Pro Gly Pro Gly Phe Thr Pro 
290 295 300 

His Asp Ser Ala Pro Tyr Gly Ala Leu Asp Met Ala Asp Phe Glu Phe 
305 310 315 320 

Glu Gin Met Phe Thr Asp Pro Leu Gly He Asp Glu Tyr Gly Gly 
325 330 335 



INFORMATION FOR SEQ ID NO:3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS: double 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: UNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



GAC GCG CTA GAC GAT TTC GAT CTG GAC ATG TTG 33 
Asp Ala Leu Asp Asp Phe Asp Leu Asp Met Leu 
15 10 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



10 
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(xi) SEQUENCE DESCRIPTION: SEQIDNO:4: 

Mp ,1a Leu Asp Asp Pbe Asp X- »P »f ~ 
1 5 
1 {2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 



20 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 5: 

Met Pro Lys Arg Pro Arg Pro 
1 5 
, 2 ) INFORMATION FOR SEQ ID NO: 6: 

25 (i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

- ~~<r . linoar 



30 



JWJ = "™~ 

( D ) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 



35 ,«) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

GRATTCGGGG CCGCGGAGGC TGGATCGGTC CCGGTGTCTT CTATGGAGGT CM-— 
40 ,^0— «^C— 

40 TGGATGG ariTGAAAGTC GAGTTTACCA CTCCCTATCA 

GTTTACCACT CCCTATCAGT GATAGAGAAA AGTGAAAGT 

STGATAGAGA AAAGTGAAAG TCGAGTTTAC CACTCCCTAT CAGTGATAGA — 
45 AGTCGAGTTT ACCACTCCCT ACCAGTGATA GAGAAAAGTG AAAGTCGAGT TTACCACTCC 
CTATCAGTGA TAGAGAAAAG TGAAAGTCGA GTTTACCACT CCCTATCAGT G_ 
50 .TGAAAGTC GAGTTTACCA CTCCCTATCA G TGATAGAGA AAAG.GAAAG TCGAGC^ 
TACCCGGGTC GAGTAGGCGT GTACGGTGGG AGGCCTATAT AAGCAGAGCT CGTTTAGTGA 
„T CGCCTGGAGA CGCCATCCAC GCTGTTTTGA CCTCCATAGA AGACACCGGG 
55 ACCGATCCAG CCTCCGCGGC CCCGAATTC 
(2) INFORMATION FOR SEQ IDNO:7: 



60 

120 

180 

240 

300 

360 

420 

480 

540 

569 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 520 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

AGATCTGCAG GGTCGCTCGG TGTTCGAGGC CACACGCGTC ACCTTAATAT GCGAAGTGGA 60 

15 CCGGATCTCG AGTTTACCAC TCCCTATCAG TGATAGAGAA AAGTGAAAGT CGAGTTTACC 120 

ACTCCCTATC AGTGATAGAG AAAAGTGAAA GTCGAGTTTA CCACTCCCTA TCAGTGATAG 180 

AGAAAAGTGA AAGTCGAGTT TACCACTCCC TATCAGTGAT AGAGAAAAGT GAAAGTCGAG 240 

20 

TTTACCACTC CCTATCAGTG ATAGAGAAAA GTGAAAGTCG AGTTTACCAC TCCCTATCAG 300 

TGATAGAGAA AAGTGAAAGT CGAGTTTACC ACTCCCTATC AGTGATAGAG AAAAGTGAAA 360 

25 GTCGAGCTCG GTACCCGGGT CGAGTAGGCG TGTACGGTGG GAGGCCTATA TAAGCAGAGC 420 

TCGTTTAGTG AACCGTCAGA TCGCCTGGAG ACGCCATCCA CGCTGTTTTG ACCTCCATAG 480 

AAGACACCGG GACCGATCCA GCCTCCGCGG CCCCGAATTC 520 



30 



(2) INFORMATION FOR SEQ ID NO: 8: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



40 



(ii) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human cytomegalovirus 

(B) STRAIN: K12, Towne 

45 . (ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 382.. 450 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

50 GAATTCCTCG AGTTTACCAC TCCCTATCAG TGATAGAGAA AAGTGAAAGT CGAGTTTACC 60 

ACTCCCTATC AGTGATAGAG AAAAGTGAAA GTCGAGTTTA CCACTCCCTA TCAGTGATAG 120 

AGAAAAGTGA AAGTCGAGTT TACCACTCCC TATCAGTGAT AGAGAAAAGT GAAAGTCGAG 180 

TTTACCACTC CCTATCAGTG ATAGAGAAAA GTGAAAGTCG AGTTTACCAC TCCCTATCAG 240 
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AAGTGAAAGT CGAGTTTACC ACTCCCTATC AGTGATAGAG AAAAGTGAAA 
GTCGAGCTC6 GTACCCGGGT CGAGTAGGCG TGTACGGTGG GAGGCCTATA TAAGCAGAGC 
TCGTTTAGTG AACCGTCAGA TCGCCTGGAG ACGCCATCCA CGCTGTTTTG ACCTCCATAG 
AAGACACCGG GACCGATCCA GCCTCCGCGG 



300 
360 
420 
450 



10 



15 



20 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii ) MOLECULE TYPE: DNA (genomic) 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human cytomegalovirus 

(B) STRAIN: Towne 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 382.. 450 



25 



30 



(xi) 

GAATTCCTCG 
GGTAAACTCG 
TTTTCTCTAT 
GGGAGTGGTA 
TTTCACTTTT 
CTGATAGGGA 
TCGTTTAGTG 
AAGACACCGG 



SEQUENCE DESCRIPTION 
ACCCGGGTAC CGAGCTCGAC 
ACTTTCACTT TTCTCTATCA 
CACTGATAGG GAGTGGTAAA 
AACTCGACTT TCACTTTTCT 
CTCTATCACT GATAGGGAGT 
GTGGTAAACT CGAGTAGGCG 
AACCGTCAGA TCGCCTGGAG 
GACCGATCCA GCCTCCGCGG 



SEQ ID NO: 9: 
TTTCACTTTT CTCTATCACT 
CTGATAGGGA GTGGTAAACT 
CTCGACTTTC ACTTTTCTCT 
CTATCACTGA TAGGGAGTGG 
GGTAAACTCG ACTTTCACTT 
TGTACGGTGG GAGGCCTATA 
ACGCCATCCA CGCTGTTTTG 



GATAGGGAGT 
CGACTTTCAC 
ATCACTGATA 
TAAACTCGAC 
TTCTCTATCA 
TAAGCAGAGC 
ACCTCCATAG 



60 



120 
180 



240 



300 



360 



420 



450 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 398 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA (genomic) 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Herpes Simplex Virus 

(B) STRAIN: KOS 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

GAGCTCGACT TTCACTTTTC TCTATCACTG ATAGGGAGTG GTAAACTCGA CTTTCACTTT 60 

TCTCTATCAC TGATAGGGAG TGGTAAACTC GACTTTCACT TTTCTCTATC ACTGATAGGG 120 

AGTGGTAAAC TCGACTTTCA CTTTTCTCTA TCACTGATAG GGAGTGGTAA ACTCGACTTT 180 

CACTTTTCTC TATCACTGAT AGGGAGTGGT AAACTCGACT TTCACTTTTC TCTATCACTG 240 

10 ATAGGGAGTG GTAAACTCGA CTTTCACTTT TCTCTATCAC TGATAGGGAG TGGTAAACTC 300 

GAGATCCGGC GAATTCGAAC ACGCAGATGC AGTCGGGGCG GCGCGGTCCG AGGTCCACTT 360 

CGCATATTAA GGTGACGCGT GTGGCCTCGA ACACCGAG 398 

(2) INFORMATION FOR SEQ ID NO: 11: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
ACTTTATCAC TGATAAACAA ACTTATCAGT GATAAAGA 33 

25 (2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
ACTCTATCAT TGATAGAGTT CCCTATCAGT GATAGAGA 38 



35 



(2) INFORMATION FOR SEQ ID NO:13: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 
AGCTTATCAT CGATAAGCTA GTTTATCACA GTTAAATT 

10 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 
,5 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
20 ACTCTATCAT TGATAGGGAA CTCTATCAAT GATAGGGA 

(2) INFORMATION FOR SEQ ID NO: 15: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
AATCTATCAC TGATAGAGTA CCCTATCATC GATAGAGA 



30 



(2) 



INFORMATION FOR SEQ ID NO: 16: 



(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 (ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID 
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ATG TCT AGA TTA GAT AAA AGT AAA GTG ATT AAC AGC GCA TTA GAG CTG 48 
Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn Ser Ala Leu Glu Leu 
15 10 15 

CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT AAA CTC GCC CAG 96 
Leu Asn Glu Val Gly lie Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

AAG CTA GGT GTA GAG CAG CCT ACA TTG TAT TGG CAT GTA AAA AAT AAG 144 
Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

CGG GCT TTG CTC GAC GCC TTA GCC ATT GAG ATG TTA GAT AGG CAC CAT 192 
Arg Ala Leu Leu Asp Ala Leu Ala lie Glu Met Leu Asp Arg His His 
50 55 60 

ACT CAC TTT TGC CCT TTA GAA GGG GAA AGC TGG CAA GAT TTT TTA CGT 240 
Thr His Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 75 80 

AAT AAG GCT AAA AGT TTT AGA TGT GCT TTA CTA AGT CAT CGC GAT GGA 288 
Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asp Gly 
85 90 95 

GCA AAA GTA CAT TTA GGT ACA CGG CCT ACA GAA AAA CAG TAT GAA ACT 336 
Ala Lys Val His Leu Gly Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 384 
Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

AAT GCA TTA TAT GCA CTC AGC GCT GTG GGG CAT TTT ACT TTA GGT TGC 432 
Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
130 135 140 

GTA TTG GAA GAT CAA GAG CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA 480 
Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 160 

CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 528 
Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala lie Glu Leu 
165 170 175 

TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG 576 
Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCC 621 
lie lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(xi , SEQUENCE DESCRIPTION: SEQ IP NO: 17: 
Het Ser Arg « ASP Ser Lys Val lie - Ser Ala - Glu « 

5 l Asn Glu Val Gly He Glu Gly Leu Thr Thr Ar;, Lys Leu Ma Gin 
20 25 

w . « «, « - «. - * - * - "S Ly< "° Lys 

10 35 

» ,i a[ill Ala He Glu Met Leu Asp Arg His His 
. Arg Ala Leu Leu Asp Ala Leu Aia ne 6Q 



SO 55 



15 



His Phe Cys Pro Leu Glu Gly Glu Ser Trp Gin Asp Phe Leu Arg 



65 70 



, ' T eu teu Ser His Arg Asp Gly 

Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Le ^ 

85 



20 , ^ Pro Thr Glu Lys Gin Tyr Glu Thr 

Ala Lys val His Leu Gly Thr Arg Pro Tnr « ^ 

100 w 

„ «, *» « - -» - - <*■ 01 » a ° Gly 5 8 " a ° 

25 us 

o .,,„ Ala Val Gly His Phe Thr Leu Gly Cys 
Asn Ala Leu Tyr Ala Leu Ser Ala vai y ^ 

130 135 



30 « » oi. »• «. - - - « - a 01 " olu *" Glu * 

145 150 

pro Thr Thr Asp s« M P~ » » » *• « £ 
165 1/0 

35 ' , ai« Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 

Phe Asp His Gin Gly Ala Glu Pro Aia l90 



180 185 



XI. lie Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 



40 "5 200 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 
45 ( A ) LENGTH: 6^1 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS: double 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l8: 
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ATG TCT AGA TTA GAT AAA AGT AAA GTG ATT AAC AGC GCA TTA GAG CTG 48 
Met Ser Arg Leu Asp Lys Ser Lys Val lie Asn Ser Ala Leu Glu Leu 
15 10 15 

5 CTT AAT GAG GTC GGA ATC GAA GGT TTA ACA ACC CGT AAA CTC GCC CAG 96 
Leu Asn Glu Val Gly He Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 
20 25 30 

AAG CTA GGT GTA GAG CAG CCT ACA CTG TAT TGG CAT GTA AAA AAT AAG 144 
10 Lys Leu Gly Val Glu Gin Pro Thr Leu Tyr Trp His Val Lys Asn Lys 
35 40 45 

CGG GCT TTG CTC GAC GCC TTA GCC ATT GAG ATG TTA GAT AGG CAC CAT 192 
Arg Ala Leu Leu Asp Ala Leu Ala He Glu Net Leu Asp Arg His His 
15 50 55 60 

ACT CAC TTT TGC CCT TTA AAA GGG GAA AGC TGG CAA GAT TTT TTA CGC 240 
Thr His Phe Cys Pro Leu Lys Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 75 80 

20 

AAT AAG GCT AAA AGT TTT AGA TGT GCT TTA CTA AGT CAT CGC AAT GGA 288 
Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asn Gly 
85 90 95 

25 GCA AAA GTA CAT TCA GAT ACA CGG CCT ACA GAA AAA CAG TAT GAA ACT 336 
Ala Lys Val His Ser Asp Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 
100 105 110 

CTC GAA AAT CAA TTA GCC TTT TTA TGC CAA CAA GGT TTT TCA CTA GAG 384 
30 Leu Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
115 120 125 

AAT GCA TTA TAT GCA CTC AGC GCT GTG GGG CAT TTT ACT TTA GGT TGC 432 
Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 
35 130 135 140 

GTA TTG GAA GAT CAA GAG CAT CAA GTC GCT AAA GAA GAA AGG GAA ACA 480 
Val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 150 155 160 

40 

CCT ACT ACT GAT AGT ATG CCG CCA TTA TTA CGA CAA GCT ATC GAA TTA 528 
Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala He Glu Leu 
165 170 175 

45 TTT GAT CAC CAA GGT GCA GAG CCA GCC TTC TTA TTC GGC CTT GAA TTG 576 
Phe Asp His Gin Gly Ala Glu Pro Ala Phe Leu Phe Gly Leu Glu Leu 
180 185 190 

ATC ATA TGC GGA TTA GAA AAA CAA CTT AAA TGT GAA AGT GGG TCC 621 
50 He He Cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
195 200 205 



(2) INFORMATION FOR SEQ ID NO: 19: 

55 (i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 207 amino acids 
<B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
Met Ser Arg Leu Asp Lys Ser ,ys val He Asn Ser Ala Leu «Oj Leu 
1 5 10 

5 ,eu Asn Glu Val Gly He Glu Gly Leu Thr Thr Arg Lys Leu Ala Gin 



20 25 



Lys L eu Gly val Glu Gin Pro Thr Leu Tyr Trp Hie Val Lys Asn Lys 



10 35 40 



Ala Leu Leu Asp Ala Leu Ala He Glu Met Leu Asp Arg His His 



50 55 



15 Thr His Phe cys Pro Leu Lys Gly Glu Ser Trp Gin Asp Phe Leu Arg 
65 70 

Asn Lys Ala Lys Ser Phe Arg Cys Ala Leu Leu Ser His Arg Asn Gly 

85 90 
^ Lys val His Ser Asp Thr Arg Pro Thr Glu Lys Gin Tyr Glu Thr 



20 



100 105 



^ Glu Asn Gin Leu Ala Phe Leu Cys Gin Gin Gly Phe Ser Leu Glu 
25 us 120 



Asn Ala Leu Tyr Ala Leu Ser Ala Val Gly His Phe Thr Leu Gly Cys 

130 135 
30 val Leu Glu Asp Gin Glu His Gin Val Ala Lys Glu Glu Arg Glu Thr 
145 "0 "5 

Pro Thr Thr Asp Ser Met Pro Pro Leu Leu Arg Gin Ala He Glu Leu 



165 



35 



Pne Asp His Gin Gly Ala Glu Pro Ala Pne Leu Phe Gly Leu Glu Leu 



180 185 



lie lie cys Gly Leu Glu Lys Gin Leu Lys Cys Glu Ser Gly Ser 
40 200 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 
45 ( A ) LENGTH: 192 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5Q (ii) MOLECULE TYPE: CDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 
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GAC ATG GAA AAA GCG ACA CCG GAG ACG ATG GTC CAT TGG ATT TGT CTG 
Asp Met Glu Lys Ala Thr Pro Glu Thr Met Val His Trp He Cys Leu 
1.5 10 15 



48 



AAG ATG GAG CCA GOT CTG TGG ATG GCC ATT ACA GCA ACA TCG CAC GGC 
Lys Met Glu Pro Ala Leu Trp Met Ala He Thr Ala Thr Ser His Gly 
20 25 30 



96 



GCA AGG CAC AGG ACA TTC GTC GGG TTT TCC GGC TGC CTC CAC CGC AAA 
10 Ala Arg His Arg Thr Phe Val Gly Phe Ser Gly Cys Leu His Arg Lys 
35 40 45 



144 



15 



TCC CTC ACG TAC CCA GTG ATA TGC CTG AGC AAA CCG AGC CAG AGG ATT 
Ser Leu Thr Tyr Pro Val He Cys Leu Ser Lys Pro Ser Gin Arg He 
50 55 60 



192 



(2) INFORMATION FOR SEQ ID NO:21: 

20 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 64 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



Asp Met Glu Lys Ala Thr Pro Glu Thr Met Val His Trp He Cys Leu 
1 5 10 is 

30 Lys Met Glu Pro Ala Leu Trp Met Ala He Thr Ala Thr Ser His Gly 
20 25 30 



35 



Ala Arg His Arg Thr Phe Val Gly Phe Ser Gly Cys Leu His Arg Lys 
35 40 45 

Ser Leu Thr Tyr Pro Val He Cys Leu Ser Lys Pro Ser Gin Arg He 
50 55 60 



(2) INFORMATION FOR SEQ ID NO: 22: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 816 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

50 CTG GAC GAC TCG AAG CGC GTA GCC AAG CGG AAG CTG ATC GAG GAG AAC 48 
Leu Asp Asp Ser Lys Arg Val Ala Lys Arg Lys Leu He Glu Glu Asn 
1 5 10 15 
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s=s==s==== £ ==? ss . 
5 --==s=ss====== = = 

35 40 

„_ gg- AGC CAC TGG AAG CAG AGG AGG 

15 lys Phe Leu Leu Glu Asp He Gly Gin Ser Fro ^ 
65 70 

nTvn ana TTC AGC GAG TTT ACA AAA ATC 

£2222222 s * - ol * ~ L sJ 

20 65 

lie Thr Pro Ala lie Thr Arg Val Vai Asp ^ 
100 105 
_ _ rrr TGC GAG GAT CAG ATC ATC CTG CTG AAG GGC 

S - »• ~ - - «- ° ly 

_ ™* r n r GTG CGC TAC GAC CCC 

130 135 



25 



96 



144 



192 



240 



288 



336 



384 



432 



»n/* r*m "LCC CTG AGC GGG GAA ATG GCC GTC AAA CGC GAG 
GAG AGC GAA ACG CTG ACG CTG AGC GGG Ma ^ Lyg ^ g Glu 

Glu Ser Glu Thr Leu Thr Leu Ser G±y ±6Q 
145 150 

s=s=ss=s======== 

165 170 

nova tyt GCC TTC AAC CTG GAC GAC ACC GAG GTG GCC 

180 185 

^ r-rr pre ATG TCC TCA GAC CGG ACG GGG CTG ATC 

Z 2 2 2 2 2 2 2 £ « «p »* « «t - »• 

195 200 

. -s=5S=S5===s ==== 

210 215 

— - .—"-JSSSSS2SSSS 

55 . Glu His Tyr lie Asn Tyr Arg Lys His Asn ix^ ^ 
225 230 



35 



40 



45 



50 



480 



528 



576 



624 



672 



720 
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AAG CTG CTG ATG AAG GTG GCG GAC CTG CGC ATG ATC GGC GCC TAC CAC 
Lys Leu Leu Met Lys Val Ala Asp Leu Arg Met He Gly Ala Tyr His 
245 250 255 



768 



GCC AGC CGC TTC CTG CAC ATG AAG GTG GAG TGC CCC ACC GAG CTC TCC 
Ala Ser Arg Phe Leu His Met Lys Val Glu Cys Pro Thr Glu Leu Ser 
260 265 270 



816 



10 (2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 272 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 



Leu Asp Asp Ser Lys Arg Val Ala Lys Arg Lys Leu He Glu Glu Asn 
1 5 10 15 

20 

Arg Glu Arg Arg Arg Lys Glu Glu Met He Lys Ser Leu Gin His Arg 
20 25 30 

Pro Ser Pro Ser Ala Glu Glu Trp Glu Leu He His Val Val Thr Glu 
25 35 40 45 

Ala His Arg Ser Thr Asn Ala Gin Gly Ser His Trp Lys Gin Arg Arg 
50 55 60 

30 Lys Phe Leu Leu Glu Asp He Gly Gin Ser Pro Met Ala Ser Met Leu 
6S 70 75 80 

Asp Gly Asp Lys Val Asp Leu Glu Ala Phe Ser Glu Phe Thr Lys He 

35 

He Thr Pro Ala He Thr Arg Val Val Asp Phe Ala Lys Asn Leu Pro 
100 105 * no 

Met Phe Ser Glu Leu Pro Cys Glu Asp Gin He He Leu Leu Lys Gly 
40 115 120 125 

Cys Cys Met Glu He Met Ser Leu Arg Ala Ala Val Arg Tyr Asp Pro 
130 135 140 

45 Glu Ser Glu Thr Leu Thr Leu Ser Gly Glu Met Ala Val Lys Arg Glu 
145 ISO 155 160 

Gin Leu Lys Asn Gly Gly Leu Gly Val Val Ser Asp Ala He Phe Asp 
165 170 175 

50 

Leu Gly Lys Ser Leu Ser Ala Phe Asn Leu Asp Asp Thr Glu Val Ala 
180 185 iso 

Leu Leu Gin Ala Val Leu Leu Met Ser Ser Asp Arg Thr Gly Leu He 
55 195 200 205 
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„ ■, Lvs He Glu Lys Cys Oln Cftu Ser Tyr Leu Leu Ala Fhe 
Cys Val Asp Lys lJ-e uiu *>j j 22Q 

210 215 
5 Glu His Tyr He Asn Tyr Arg Lys His Asn lie Pro His Phe Trp Ser 
225 230 

. t tr»i ■ *ia Asd Leu Arg Met He Gly Ala Tyr His 
Lys Leu Leu Met Lys Val Ala Asp Leu a ^ 

245 2bU 

10 --.'•fi. tvs val Glu Cys Pro Thr Glu Leu Ser 

Ala Ser Arg Phe Leu His Met Lys val Giu y ^ 



260 265 



(2) INFORMATION FOR SEQ ID NO: 24: 

i< (i ) SEQUENCE CHARACTERISTICS: 

1J (A ) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 
TCCCCGGGTA ACTAAGTAAG GATCC 

25 (2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
30 (D) TOPOLOGY: linear 

(ii) MOLECULB TYPE: DNA 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO:25: 

35 AGTGGGTCCC CGGGTGACAT GGAA 

(2) INFORMATION FOR SEQ ID NO:26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 



40 



(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: polypeptide 
(v) FRAGMENT TYPE: internal 
45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 



25 



24 
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Ser Gly Ser Pro Gly Asp Met Glu 
X 5 



(2) INFORMATION FOR SEQ ID NO:27: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH": 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDBDNBSS: double 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
AGTGGGTCCC CGGGTCTGGA CGAC 24 



15 



(2) INFORMATION FOR SEQ ID NO:28: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: polypeptide 
(v) FRAGMENT TYPE: internal 

25 (xi) SEQUENCE DESCRIPTION: SEQ IDNO:28: 

Ser Gly Ser Pro Gly Leu Asp Asp 
1 5 
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CLAIMS 



which activates transcription in eukaryotic cells. 

2 . Ti«n«deicacidof claim l.^totopo^**^* 1 * 1 ^ 

acid substitution compared to a wild-type Tet repressor. 

acid addition or deletion compared to a wild-type Tet repressor. 

!' • ™r having an amino acid substitution at at least one amino aad posmon 

compris.meanm.oaddsequencecorrespondm^ 

7 Thenucleicacidofcl^ 

activation domain of herpes simplex virion protein 16. 

25 8 Tnenucleica«dofclaim7,w^ 

addsequenceconesponding to positions 208to335ofSEQIDNO:2. 

30 9 The nucleic acidofclaimT.wh^inme transcription 
Lone^ofaC-terminalregionofh^ 
amino acid sequence shown in SEQ ID NO: 4. 

10 Thenucleicacidofcla^ 

Tnlaprolme-rich^ptionactivationdomama 

Ovation domain and a glutamine-rich transcription activation domain. 
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1 1 . The nucleic acid molecule of claim 1 , wherein the fusion protein further comprises an 
operatively linked third polypeptide which promotes transport of the fusion protein to a cell 
nucleus. 

5 12. A fusion protein which activates transcription comprising a first polypeptide which 
binds to a tet operator sequence in the presence of tetracycline or a tetracycline analogue 
operatively linked to a second polypeptide which activates transcription in eukaryotic cells. 

13. The fusion protein of claim 1 2, wherein the first polypeptide is a mutated Tet 
10 repressor. 

14. The fusion protein of claim 12, wherein the second polypeptide comprises a 
transcription activation domain of herpes simplex virion protein 16. 

15 15. The fusion protein of claim 12, which comprises an amino acid sequence shown in 
SEQIDNO:2. 

16. A recombinant vector comprising the nucleic acid molecule of claim 1 in a form 
suitable for expression of the fusion protein in a host cell. 

20 

17. The recombinant vector of claim 16, wherein expression of the fusion protein is 
regulated by at least one tissue-specific regulatory element. 

1 8. The recombinant vector of claim 1 6, wherein expression of the fusion protein is 
25 regulated by at least one tet operator sequence. 

19. A host cell comprising the recombinant vector of claim 16. 

20. The host cell of claim 1 9, further comprising a nucleotide sequence to be transcribed 
30 operatively linked to at least one tet operator sequence. 

21 . The host cell of claim 20, wherein the nucleotide sequence to be transcribed is an * 
exogenous nucleotide sequence introduced into the host cell. 

35 22. The host cell of claim 20, wherein the nucleotide sequence to be transcribed is an 
endogenous nucleotide sequence to which at least one tet operator sequence has been 
operatively linked. 



23. The host cell of claim 20, which is a mammalian cell. 
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5 • ,;„„ of the nucleotide sequence operatively linked 

doxycycline or cyanotetiacycline. 




• «f Maim 29 wherein the transgene is integrated at 
30 The non-human transgenic organun of claim 29, where 

„ n tn.ns.enic organism of claim 29, further comprising a nucleotide 
. nntn ^ eem c organism of c^^^ 

H mm ^»fc*33,^ to ^^^^«-' di - 
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operator sequence operatively linked to a heterologous second polypeptide which inhibits 
transcription in eukaryotic cells. 

36. The nucleic acid of claim 35, wherein the first polypeptide binds to a tet operator 
5 sequence in the absence but not the presence of tetracycline or a tetracycline analogue. 

37. The nucleic acid of claim 36, wherein the first polypeptide is a Tet repressor. 

38. The nucleic acid of claim 37, wherein the first polypeptide comprises an amino acid 
10 sequence shown in SEQ ID NO: 17. 

39. The nucleic acid of claim 35, wherein the first polypeptide binds to a tet operator 
sequence in the presence but not the absence of tetracycline or a tetracycline analogue. 

1 5 40. The nucleic acid of claim 39, wherein the first polypeptide is a mutated Tet repressor. 

4 1 . The nucleic acid of claim 40, wherein the mutated Tet repressor has at least one 
amino acid substitution compared to a wild-type Tet repressor. 

20 42. The nucleic acid of claim 40, wherein the mutated Tet repressor has at least one 
amino acid addition or deletion compared to a wild-type Tet repressor. 

43. The nucleic acid of claim 41 , wherein the mutated Tet repressor has an amino acid 
substitution at at least one amino acid position corresponding to an amino acid position 

25 selected from the group consisting of position 71, position 95, position 101 and position 102 
of a wild-type TnlO-derived Tet repressor amino acid sequence. 

44. The nucleic acid of claim 43, wherein the mutated Tet repressor comprises an amino 
acid sequence shown in SEQ ID NO: 19. 

30 

45. The nucleic acid of claim 35, wherein the second polypeptide comprises a 
transcription silencer domain of a v-erbA oncogene product. 

46. The nucleic acid of claim 45, wherein the second polypeptide comprises an amino 
35 acid sequence shown in SEQ ID NO: 23 . 

47. The nucleic acid of claim 35, wherein the second polypeptide comprises a 
transcription silencer domain of a Drosophila Kmeppel protein. 
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acid sequence shown in SEQ IDNO-21. 

, • .„Mrf claim 35 wherein the second polypeptide comprises a 
49. The nucle.0 acui of d» consisiting of the retinoK 
< ^nscriDtion silencer domain of aprotem selected rr * 



the Drosophila engrailed protein, E4BP4 and ZF5. 

0 , , t,w, m K wherein the fusion protein further comprises 

50. Thenucleicaddmole^eofclaunB J fmefeion protein to a 

an operatively linked third polypept.de wh,ch promotes transport 

cell nucleus. 

. , . « • . . Wfc won^criDtion in eukaryotic cells, comprising a first 

sequence shown in SEQ ED NO: 17. 

sequence shown in SEQ ID NO: 19. 

30 56 Tl.^^ofd-.Sl.^^i^P'^"^* 
transcription silencer domain of a v-erbA oncogene product. 

57 . r**^***^*^**^"*"' 9 ** 

35 . acid sequence shown in SEQ ID NO: 23. 

58 The fusion protein of claim 5 1 , wherein the second polypeptide comprises a 
transcription silencer domain of a Drosophila Krueppelprotem. 
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59. The fusion protein of claim 5 8, wherein the second polypeptide comprises an amino 
acid sequence shown in SEQ ID NO: 21 . 

60. The fusion protein of claim 5 1 , wherein the second polypeptide comprises a 

5 transcription silencer domain of a protein selected from the group consisiting of the retinoic 
acid receptor alpha, the thyroid hormone receptor alpha, the yeast Ssn6/Tupl protein 
complex, the Drosophila protein even-skipped, SIR1, NePl, the Drosophila dorsal protein, 
TSF3, SFI, the Drosophila hunchback protein, the Drosophila knirps protein, WT1 , Oct-2.1 , 
the Drosophila engrailed protein, E4BP4 and ZF5. 

10 

61 . A recombinant vector comprising the nucleic acid molecule of claim 35 in a form 
suitable for expression of the fusion protein in a host cell. 

62. A host cell comprising the recombinant vector of claim 61 . 

15 

63. The host cell of claim 62, further comprising a nucleotide sequence to be transcribed 
operatively linked to at least one tet operator sequence. 

64. The host cell of claim 63, wherein the nucleotide sequence to be transcribed is an 
20 exogenous nucleotide sequence introduced into the host cell. 

65. The host cell of claim 63, wherein the nucleotide sequence to be transcribed is an 
endogenous nucleotide sequence to which at least one tet operator sequence has been 
operatively linked. 

25 

66. A method for modulating transcription of the nucleotide sequence operatively linked 
to the at least one tet operator sequence in the host cell of claim 63, comprising modulating 
the concentration of tetracycline or a tetracycline analogue that is in contact with the host 
cell. 

30 

67. A non-human transgenic organism comprising a transgene comprising the nucleic 
acid molecule of claim 35 in a form suitable for expression of the fusion protein in cells of 
the non-human transgenic organism. 

35 68. The non-human transgenic organim of claim 67, wherein the transgene is integrated at 
a predetermined location within the genome of the organism. 

69. The non-human transgenic organism of claim 67, further comprising a nucleotide 
sequence to be transcribed operatively linked to at least one tet operator sequence. 
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a second transgene encoding a second fusion protein which inhibits 
transcription, the second fusion protein comprising a third polypeptide which binds to a tet 
operator sequence operatively linked to a fourth polypeptide which inhibits transcription in 
eukaryotic cells; and 

5 a third transgene comprising a nucleotide sequence to be transcribed 

operatively linked to at least one tet operator sequence. 

77. The non-human transgenic animal of claim 76, wherein the first polypeptide binds to 
a tet operator sequence in the presence, but not the absence of tetracycline or a tetracycline 

1 0 analogue and the third polypeptide binds to a tet operator sequence in the absence, but not the 
presence of tetracycline or a tetracycline analogue. 

78. The non-human transgenic animal of claim 76, wherein the first polypeptide binds to 
a tet operator sequence in the absence, but not the presence of tetracycline or a tetracycline 

1 5 analogue and the third polypeptide binds to a tet operator sequence in the presence, but not 
the absence, of tetracycline or a tetracycline analogue. 

79. A recombinant vector for coordinate, bidirectional transcription of a first and a second 
nucleotide sequence to be transcribed, the vector comprising a nucleotide seiquence 

20 comprising in a 5* to 3' direction: 

a first cloning site for introduction of a first nucleotide sequence to be transcribed, 
which is operatively linked to 

at least one tet operator sequence, which is operatively linked to 
25 a second cloning site for introduction of a second nucleotide sequence to be 

transcribed, 

wherein transcription of the first and second nucleotide sequence introduced into the vector 
proceeds in opposite directions relative to the at least one tet operator sequence. 

30 

80. The recombinant vector of claim 79, wherein the first and second nucleotide sequence 
to be transcribed have been introduced into the first and second cloning sites, respectively. 

8 1 . The recombinant vector of claim 80, wherein the first and second nucleotide 
3 5 sequences encodes polypeptide subunits of a heterodimeric protein. 



82. The recombinant vector of claim 80, wherein the first nucleotide sequence encodes a 
protein of interest and the second nucleotide sequence encodes a detectable marker. 
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FIG. 2 
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FIG.3 
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FIG. 4A 
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FIG. 7A 
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